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C. E. LAPPLE 


I. Introduction 

Industrial operations, particularly chemical and metallurgical, deal ex¬ 
tensively with particulate systems, for example, powders, aerosols, fluidized 
beds, ores, paints, and insecticides. In such systems, electrostatic phenomena 
can play many important roles: (1) they can markedly affect the behavior or 
properties of the materials being handled, as in conveying, grinding, and 
screening; (2) they can be utilized as direct processing means as in electro¬ 
static printing, atomization, minerals separation, or aerosol precipitation; and 
(3) they may be used to control, modify, or trigger other processes, both 
detrimentally and beneficially, as in setting off dust explosions, in causing 
adhesion of films and fibers, and in the controlling of spray deposition. 

In the past, many of the electrostatic phenomena have been treated as 
unavoidable nuisances, and many processing problems have been blamed 
rather casually on electrostatic influences. Only in recent years has the scope 
of the role of electrostatics and its potential for application in processing 
begun to be more fully understood. 

It is the purpose of this chapter to summarize knowledge dealing with the 
role of electrostatic phenomena in particulate systems, especially in aerosols. 
A relatively comprehensive annotated bibliography by Blake and Lapple (BIO) 
has surveyed the bulk of the literature through 1965. This chapter cites and 
summarizes the results of the more important or directly relevant work in this 
field, 1 However, before considering specific phenomena, it is desirable to 
review some of the fundamental principles in the field of electrostatics. 


II. Electrostatic Fundamentals 

Bodies that acquire a net electric charge because of the loss or gain of an 
electron or molecular ion are said to be charged. The charge is said to be 
unipolar if all particles have the same sign of charge, and ambipolar if par¬ 
ticles of both signs are involved. 2 The terms homopolar and heteropolar refer 
to the uniformity or nonuniformity of the magnitude of the charge on the 
component particles. 

1 Although aimed at different objectives, the processing problems of Edgewood Arsenal, 
for whom this study was originally conducted, are in many respects very similar to those 
of the process industries. While this chapter has been written with the process industries 
in mind, its origin has resulted in a somewhat greater emphasis on the aerosol particulate 
system than might otherwise have been the case. 

2 In some literature the terms “monopolar” and “bipolar” are used, but these may 
be mistakenly thought to indicate the magnitude of the charge, especially when referring 
to ions. 
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In the presence of an electrostatic field, all materials become polarized; 
that is, they acquire a dipole resulting from orientation of their molecules 
by the field or displacement of their free electrons. This dipole tends to 
counteract the imposed electrostatic field and is directly related to the dielec¬ 
tric constant of the material. In normal gases (dielectric constant = 1), the 
polarization effect is negligible. It is a maximum in conductors, in which the 
polarization is actually the result of migration of free electrons rather than 
being simply an orientation of molecules. In most materials (always with 
conductors), the polarization will disappear when the field is removed. In 
some materials, called electrets, the dipoles induced by an electrostatic field 
will remain even after the field is removed, although they may disappear after 
long periods of time. In other materials (e.g., ferroelectrics), stable dipoles 
tend to form spontaneously, even in the absence of electrostatic fields. 
Polarized bodies will have an electrostatic force field around them because 
of the dipoles but will not have a net charge unless it is acquired by other 
means. Polarization processes may play a significant role in some processes 
(e.g., electrostatic atomization) and are considered in those instances. The 
bulk of this chapter, however, is concerned with the phenomena related to 
particles with net electrical charges. 

The magnitude of charge on a particle can be expressed in any one of 
several ways: total charge (J p ), surface charge density (J p5 ), specific particle 
surface gradient or field (<f ps ), charge-to-mass ratio ( R c ), or specific particle 
potential ( E ps ). These are related by the following identities: 

<p — ^ps _ &ps _ __ Pp Dp Rc / j \ 

ps D p eS nzdDp 6e<5 1 ' 

where D p is the diameter of particle; p p , the density of particle; e, the per¬ 
mittivity of free space, a universal constant; and <5, the dielectric constant of 
the surrounding medium. Of these terms, l p and R c are always physically 
real quantities. The rest of the terms have real physical significance under 
certain circumstances. For example, $ ps is the potential gradient or field 
intensity existing on the outside surface of a particle only if the charge is 
distributed uniformly around the surface of the particle. However, in any 
case the terms <g ps , J ps , and E ps , are defined in terms of £ p , or R c by Eq. (1), 
regardless of their physical meaning. 

Electrostatic phenomena are a function of particle properties as well as 
particle charge. Of particle properties, the one most important and subject 
to the widest variation is particle size. 3 Consequently, it would be most con- 

3 Particle size will always be expressed in terms of diameter rather than radius in this 
chapter. In all developments in this chapter, the particles will be assumed to be spherical. 
Consequently, when applying the results to other than spherical particles, that equivalent 
spherical diameter must be used which would correspond to the phenomenon involved. 
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Fig. la. Relationship between specific particle surface gradient and charge-to-mass 
ratio for various particle diameters. 


venient to express particle charge in that combination with particle size which 
most nearly is a direct measure of the phenomenon. While this combination 
varies with the phenomenon, it is believed that the term & ps is a direct measure 
of the most frequently encountered phenomena. For this reason all particle 
charges are reported in terms of S ps in this chapter, usually using the units 
of volts/micron. Since particle charge is often measured in terms of charge- 
to-mass ratio, R c , the relationship between R c and S ps is shown in Fig. la 
as a function of particle diameter. 

In order to provide a better means for visualizing the physical significance 
of both the specific particle surface gradient, g ps9 and the specific particle 
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Fig. lb. Relationship between specific particle surface gradient and number of elec¬ 
trons on particle for various particle diameters. 


potential, E ps , Figs, lb and lc give the particle charge, in number of electrons, 
for various particle sizes corresponding to various levels of g and E ps . 
Figures la-lc are simply graphical portrayals of Eq. (1). 

In this chapter all equations are written in a form for which any consistent 
system of units may be used. The table of nomenclature gives illustrative 
units in the MKSA system. By definition, 

F=SSL (2) 

where F is the force acting on a point charge SL situated in an electrostatic 
field of intensity S. In the rationalized form, the basic electrostatic force 
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Fig. lc. Relationship between specific particle potential and number of electrons on 
particle for various particle diameters. 

equation (Coulomb’s law) is 


F = SMlmdL 2 (3) 

where F is the force between two point charges, and > distance L apart 
in a medium of dielectric constant S. 

Davies (D2) has given a very brief review of some of the recent literature 
related to electrical properties of aerosols. 
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HI. Effects of Electrostatic Charges 
A. On Material Properties 

The presence of an electrostatic field subjects a material to a stress. Such 
fields can arise from the presence of free charges in the material or from the 
action of an external field. A number of authors (Hogan, HI 2, HI 3; Graf, G8; 
Doyle, Moffett, and Vonnegut, D6; and others) give the outward pressure on 
a liquid sphere with a uniformly distributed surface charge as 



which assumes that the sphere is in a medium with unit dielectric constant. 
Thus a value of S ps of 1 V/micron, would correspond to an outward electro¬ 
static pressure of 44 dyn/cm 2 or 0.00064 psi, which is a comparatively small 
pressure. 

For comparison, the constraining pressure due to surface tension, a, is 
given by 



For a 1-micron-diam water particle, p a will be 2.88 x 10 6 dyn/cm 2 or 42 psi. 

The effect of electric charge on the vapor pressure of a liquid is given by 
the classic equation of Thomson as quoted by Gallily and Ailam (Gl) and 
Vonnegut, Moffett, Sliney, and Doyle (V3) 



where p p is the equilibrium vapor pressure from a drop of diameter D p of a 
liquid of density p p , whose vapor pressure from a flat surface under the same 
total pressure as that of the fluid surrounding the drop would be p at the 
same temperature. The density p g is that at the absolute pressure p T and 
temperature T of the pure vapor of the liquid. The terms p c and p e are the 
pressures due to surface tension and surface charges as given by Eqs. (5) 
and (4), respectively. This equation is derived for a conducting liquid in a 
medium, devoid of external force fields, with a dielectric constant of unity. 
Surface tension results in an inward pressure on the liquid, which is partly 
counteracted by the presence of a charge at the surface. Thus, surface tension 
will cause an increase in vapor pressure but only because of the curved sur¬ 
face. This effect does not normally become significant until the drop diameter 
is less than 1 micron. The effect of a charge on the surface is to reduce vapor 
pressure and hence counteract the effect of surface tension. Gallily and Ailam 
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(Gl) have also derived a similar modified equation for the case of a drop of 
conducting liquid surrounded by a gaseous suspension of both neutral and 
charged particles, enclosed in a spherical container. 

From the illustrative value given for p e above it is apparent that electro¬ 
static effects on vapor pressure will be negligible unless the particle charge 
level is orders of magnitude greater than 1 V/micron. 

The discussion above has considered the effect of free charges and surface 
tension on vapor pressure. By exposing a polarizable liquid surface to an 
electrostatic field, similar effects can result because of the surface charges 
created by polarization. Hogan (H12, H13) considers these effects, including 
the effect due to electroconstriction. Although the effect of a free surface 
charge is always to exert an outward pressure, an inward pressure can exist 
if a sufficiently large tangential field intensity exists at the surface of a polariz¬ 
able liquid. In the absence of tangential stresses, polarization effects can be 
allowed for by multiplying the right-hand side of Eq. (4) by a factor that is a 
function of the properties of the material. As indicated by Schultz and Wiech 
(S3), this factor may range from 0.55 for nonpolar unassociated liquids to 
unity for polar highly associated liquids. 

There is also the possibility of having surface tension affected directly by 
the presence of an electrostatic field. To some extent this will be a matter of 
definition since the outward pressure due to a surface charge could be defined 
as an apparent effect on surface tension. Hurd, Schmid, and Snavely (HI5) 
measured the surface tension of water and water solutions when fields up to 
0.7 V/micron were applied across the air-solution interface. The results 
showed a reduction in surface tension of less than 1 %. These data must not 
be considered conclusive, however, because insufficient details are reported 
to permit assessment of the exact nature of the electrostatic field applied or 
of the validity of a number of corrections that had to be applied but were 
reported to be very large and difficult to apply. 

Electrostatic charges or force fields could also influence other properties 
such as viscosity and freezing point, but no attempt has been made to extend 
the review to these areas. 

B. On Containment or Bulk Storage 

In storing a precharged powdered material, one must consider the rate 
and manner in which electric charge may be lost. This charge may be lost by 
conduction through the powder to the container and then either by con¬ 
duction to the surrounding supports or by corona to the atmosphere. 

It can be readily shown that for a volume of unipolar and uniformly dis¬ 
tributed charged particles bounded by a conductive surface equidistant from 
a center of symmetry [either a plane (flat volume); a line (cylindrical volume); 
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or a point (spherical volume)], the following describes the rate at which the 
volume will lose its charge 


t — £<5^0 \n(3 0 /l) (7) 

where t is the time for the total charge within the volume to drop from 1 0 
to 1 for a material of resistivity and dielectric constant S v . If the powder 
is ambipolar, Eq. (7) will still apply if £> 0 and SL are considered as net charges. 
This is essentially a generalized form of the analysis given by Shaffer (S5). 
Values of S v & 0 corresponding to various rates of charge loss are shown 
in Table I. 


TABLE I 

Value of 8 v & 0 Corresponding to 
Various Relaxation Times 


Relaxation time 41 

tR 

8. Sto 
(Q)(cm) 

1 msec 

1.129 X 10 10 

1 sec 

1.129 x 10 13 

1 min 

6.77 X 10 14 

1 hr 

4.06 XlO 16 

1 day 

9.75 X 10 17 

1 year 

3.56 X 10 20 


a Relaxation time is defined as the 
time for the charge to drop by a factor 
of e, i.e., for ln(J2 0 /^) = 1, following 
the terminology used by Vonnegut et 
ah (V3) for the group eS t ,^? 0 in a study 
of another but related phenomenon. 

From these values it is apparent that very few materials could retain a 
significant fraction of their original charge for many days. Most materials 
could retain their charge for only a matter of minutes or less. Although 
Eq. (7) is derived for a specific type of geometry, it should also be applicable 
to give the correct order of magnitude for the time required for a deposited 
particle to lose or acquire charge in the absence of a superimposed field. In 
that case would include any surface resistivity. 

An assemblage of even lightly charged particles can result in very high 
electrostatic field intensities at the surface of the assemblage. It is readily 
shown that for a volume of unipolar and uniformly distributed charged 
particles bounded by a surface equidistant from a center of symmetry, the 
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field intensity at the outer edge of the volume is given by 


6c p {VjA)S ps _ 6mS ps 
PpDp Ap p D p 


= (for spherical assemblage) 


CpDfps 

Pp D p 


( 8 ) 


where c p is the mass concentration of particles of average size D p , average 
density p p9 and average net charge corresponding to £ ps in a volume V 
bounded by a surface of area A. The total mass of particles in the volume V 
is m, and D is the diameter of the spherical volume. This is also a generalized 
treatment of the analysis given by Shaffer (S5). 

For powders, c p /p p is approximately one-half. If we further assume that 
the outer edge of the volume will go into corona when £ exceeds 3 V/micron, 
which is the normal electrical breakdown strength of a flat surface in air, 
Table II shows the maximum size containers that can be used for various net 
levels of particle charging without having the charge lost by corona. 


TABLE II 

Illustrative Values of Maximum Allowable 
Container Diameter for Various Particle Charge 
Levels 

(for «? < 3 V/micron; c p jp p = 0.5) 


Specific particle surface 

Maximum allowable 

gradient, £ ps , 

container diameter 

V/micron 

D/D p 

1 

6 

0.1 

60 

0.01 

600 

0.001 

6000 


Thus, even at the very low particle charge levels of 0.001 V/micron (corre¬ 
sponding to an average charge of 0.2 electron on a 1-micron particle), the 
container cannot exceed 6000 times the particle diameter or 6 cm for a 10- 
micron particle. 

The above confirms the conclusions reached by Shaffer (S5) that unipolar 
precharging of powders is impractical because of considerations of charge 
loss during storage. 

C. On Aerosol Properties 

A cloud is essentially an aerosol with mobile boundaries. The stability 
of a cloud involves two distinct aspects. Because of Coulomb, image, and 
diffusion forces, particles in a given volume will flocculate. This will result 
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in a lower number concentration but the same mass concentration of particles 
(provided no loss occurs by diffusion to bounding walls or by settling out 
under the action of gravity). This can occur even with uncharged particles 
(by either Brownian motion or turbulence flocculation). If the cloud contains 
a net charge of one sign, the cloud will tend to expand because of the net 
repulsive forces set up within the cloud. This will result in a decrease in both 
number and mass concentration and will be superimposed on flocculation 
effects. The following discusses the role of these phenomena in determining 
the behavior and properties of an aerosol (or cloud). 

1. Cloud Expansion and Cloud Properties 

For a cloud of unipolar and uniformly distributed charged particles 
bounded by a surface equidistant from a center of symmetry, the rate of 
cloud expansion due to the repulsion of the charged particles within the 
cloud is given by 


-dc p 


'-drip 


’2/c c e S v c p £ p ; 

c p dt _ 


.n p dt_ 


Wp 


where c p is the mass concentration of particles at time t\ n p is the number 
concentration at the time t\ p is the gas viscosity; p p is the particle density; 
k c is the Stokes-Cunningham factor (a function of particle diameter and 
mean free path of gas molecules), and S ps is the specific particle surface 
gradient. The specific dilution time, t sd9 is defined by the above equation. 
This and all subsequent equations, while derived for a unipolar cloud, will 
still hold for an ambipolar cloud if S ps is defined in terms of the net charge. 
This is essentially a generalized version of the analysis presented by Foster 
(FI) and mentioned by Zebel (Z2). If d v is independent of c p9 Eq. (9) can be 
integrated to 


t = 4d[l - (Cplc p o)] (10) 

where t is the time for the cloud to expand from a concentration c p0 to a 
concentration c p . Thus the specific dilution time, t sd9 is actually the hypo¬ 
thetical time required for a cloud to expand from an infinite concentration 
of particles to a given concentration c p . For most practical purposes, t sd will 
also be identical with the time after initial formation for a cloud to drop to a 
concentration c p . 

The radial outward velocity of the edge of the cloud is given by 

VIA „ . . D 

u R =-— (for spherical cloud) -— 

tsd ®t sd 


(11) 
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where V is the cloud volume, A the total area of the edge of the cloud, and D 
the diameter of a spherical cloud. The field intensity at the edge of the cloud 
is given by an equation identical to Eq. (8). 

A plot of Eq. (9) is given in Fig. 2 for air at 25°C taking p p = 1 gm/cm 3 , 
d v = 1, and k c — 1. It is apparent from Fig. 2 that concentrations of the order 
of 100-1000 ^g/liter could not exist for a matter of more than a few seconds 
if & ps is greater than 1 V/micron. The above analysis allows only for cloud 
expansion due to the net charge on the particles. In practice, expansion rates 
will be even greater because of any added convective dilutions due to tem¬ 
perature or wind velocity gradients. It is, therefore, apparent that net charges 
greater than that corresponding to £ ps = 1 V/micron cannot be tolerated from 
considerations of cloud stability for applications requiring aerosol concen¬ 
trations greater than 100 /xg/liter. 

It is interesting to note from Eq. (11), that a 60-ft-diam field cloud of 
concentration 100 /ig/liter with a charge corresponding to & ps = 1 V/micron 



Fig. 2. Rate of expansion of charged clouds. 
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would be expanding at a radial velocity of 1 ft/sec. Prior to that point, the 
cloud would be expanding at a greater velocity. 

Fuchs (F4, pp. 102-105) also points out that, for a unipolar aerosol, the 
rate of concentration change, as given by Eq. (9) or (10), is an intrinsic 
property of the aerosol, depending on neither size nor shape of cloud. Some 
of the other relationships [Eqs. (8) and (11)], however, are subject to the 
cloud symmetry specified above. Fuchs also points out that in an ambipolar 
cloud the particles of minority sign tend to be driven toward the center or 
core where the cloud approaches neutrality asymptotically. The outer edge 
of the cloud tends to become unipolar with the sign of the initially most 
prevalent charge. 

Equation (8) is applicable to clouds of particles as well as powders. From 
Eq. (8) it is readily shown that even a lightly charged cloud would have a 
field intensity in excess of 10 V/cm (or 0.001 V/micron) at the outer edge. 
(For example, £ will be 15 V/cm, when p p = 1 gm/cm 3 ; c p = 1000 /ig/liter; 
D p = 10 microns; D = 50 ft; and S ps = 0.001 V/micron.) 

It is also interesting to note that natural atmospheric or terrestrial electro¬ 
static fields are of the order of 1 V/cm (Schonland, SI) in the regions near 
the ground, with the earth’s surface usually negative relative to the atmo¬ 
sphere. These fields are small compared with those resulting from even a 
lightly charged cloud. Although the small terrestrial fields may be negligible 
from the standpoint of aerosol dissemination, they may be significant in 
other atmospheric phenomena. 

There is a possibility that a cloud might lose charge by corona during its 
early concentrated stage. For example, if it is assumed that the maximum 
field intensity at the outer edge cannot exceed 3 V/micron (the normal break¬ 
down strength of air) without having corona ensue, then Eq. (8) would 
predict the relationships shown in Table III for a net particle charge corre- 

TABLE III 

Illustrative Values of Maximum Permissible Cloud Diameter 
for Various Particle Concentrations 
(for & pa — 1 V/micron; p p = 1 gm/cm 3 ; D p = 10 microns) 


Particle 
concentration 
in cloud, c p 
(fig/liter) (lb/lb air) 

Maximum permissible cloud diameter, 

D (without exceeding field intensity 
of 3 V/micron) 

(ft) 

10 6 

1 

0.1 

10 5 

0.1 

1 

10* 

0.01 

10 

10 3 

0.001 

100 
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sponding to 1 V/micron, a particle density of 1 gm/cm 3 , and a particle dia¬ 
meter of 10 microns. 

This would imply that the concentration of the initially disseminated cloud 
would be limited by its initial size as indicated above if no charge is to be 
lost owing to corona. However, it is not certain that corona would actualy 
ensue, especially where the cloud is expanding simultaneously. Even if there 
were some corona, the lost ions might be recaptured by the expanding cloud. 
Consequently, the above simply represents a conservative estimate of possible 
limitations on the initial cloud or aerosol jet. 


2. Optimum Charge 


In such applications as insecticide or pesticide spraying, riot control, or 
chemical warfare, it is the objective to expose objects within a given area to 
either an incapacitating or lethal dosage of an agent, as the case may warrant. 
If a cloud expands to cover a ground area A b in time t , then any object 
exposed at the edge of this area will have received a critical dosage at a later 
time t a provided 

[ c dt > (ct) c (12) 


where {ct) c is the critical dosage usually referred to as a ct value. All other 
objects within this area will already have received at least a critical dosage 
in time t a . If it is further assumed that the number of target objects exposed 
is proportional to the area A b and that the area A h is proportional to the $ 
power of the cloud volume at any time, it can be shown that there is an 
optimum value of S ps , which will result in a maximum number of casualties 
for any given time t a and total mass of agent released. The optimum is 
given by 


' P S opt 




2k c e 8(ct) J 


1/2 


(13) 


The actual number of casualties will, of course, depend on both the value 
of t a desired and the mass of agent disseminated. Assuming a value of 10 
(/ig/liter)(min) for ( ct ) c , the optimum value of $ ps will be approximately 
1 V/micron. This is, of course, a simplified analysis and various further con¬ 
siderations should be included such as allowances for target objects seeking 
shelter or protection, variation of {ct) c values with exposure time and particle 
charging, nature of agent dispersion prior to dissemination, etc. This also 
assumes that cloud expansion is governed solely by the electrostatic charge 
of the cloud and makes no allowance for other factors such as atmospheric 
turbulence. 
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3. Aerosol Stability 


Kunkel (K9) has derived expressions for the collision rate of charged 
particles allowing for both Coulomb attraction forces and ionic image forces. 
His final expressions, however, are not readily applied and he concludes only 
that the rate of growth of a particle over a period of several seconds is small if 


\£p£pa\ n p jqIO 

D p I 2 


(14) 


where 2L P is the charge of the particle in electrons; £ pa is the average charge 
of other particles of one sign; n p is the number concentration of particles, 
particles/centimeter 3 ; and D p is the average particle diameter in microns. 
Assuming an equal magnitude of charge on all particles, this can be written 
in the form 

$ls c P lpp < 100 (15) 


where S ps is in volts/micron; c p is in micrograms/liter; p p is in grams/centi¬ 
meter 3 . Thus for p p = 1 gm/cm 3 and c p = 1000 /ig/liter, i ps must be less than 
0.3 V/micron. Thus one would conclude from Kunkel that flocculation due 
to charge would be small in a period of minutes if were less than 0.01 
V/micron. 

A more thorough analysis of flocculation is given by Fuchs (F3) and by 
Zebel (Z1,Z2), both of whom arrived at essentially the same result. The 
flocculation rate can be written in the form 


“ {dn p I dt)jn p = k em K s n p (16) 

where K s is the flocculation coefficient in the absence of charge and k em is a 
correction factor to allow for both particle charge and particle mean free 
path. Fuchs and Zebel obtained for k em 


where 


= 


k = 


N. 


'-1 + N n 


JV = ^pl^pl 

9 2nz5kT(D pl + D p2 ) 
N me = N m N q e N < 




(D,i + Dpi) \ 


m 


m„,m 


’pi^pi 


™ pi + tn p2 ) J 


1/2 


(17) 

(18) 

(19) 

( 20 ) 


N q is a parameter to allow for charge effects and N m is a parameter to allow 
for slip between particles (as distinguished from slip between molecules) in 
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the collision between two particles of different sizes D pl and D p2 , diffusion 
coefficients & pX and 0 p2 , and masses m pl and m p2 . The term k em is the 
collision rate correction factor, which measures the combined effect of both 
of these factors, and k is the Boltzmann constant. Both Fuchs and Zebel 
independently obtained the N q term in Eq. (17). The N me term is a refinement 
taken from Zebel. 

When dealing with a monodisperse homopolar aerosol (i.e., one in which 
all particles have the same size and magnitude of charge), the coefficient K s 
takes the well-known form 


K s = 4k c kT/3n 

(21) 

„ ±-2 p 2 ± mSD p 3 & p S 

• “ 4ns5kTD p ~ 4k T 

(22) 

/16fc c 2 p.kr\ 1/2 

(23) 

m ~ ( Unn 2 D p ) 


where the plus sign in N q applies where unipolar charges are involved (all 
particles the same sign and repelling each other), and the minus sign applies 
to the ambipolar case where the particles attract. The term N m will always 
be negligible for D p > 1 micron. 

The following special cases of Eq. (17) should be noted: 


(a) Particles uncharged: 

fc = 1 
em 1 +N m 

(24) 

(b) Large ambipolar charge: 

Km = W 

(25) 

(c) Large unipolar charge: 



Km 

N q 

e N "(l + N q N m ) 

(26) 

In the derivations of the above equations, image forces (which are likely 
to be small compared to the Coulomb forces) were neglected and every 
collision was assumed to result in adhesion. Although some have questioned 
the validity of the latter assumption, there is no evidence to indicate that this 
is not, at least nominally, the case when dealing with particles smaller than 
10 microns colliding at low velocities. While this subject of adhesion has 


never been investigated systematically, the bulk of all the data on aerosol 
flocculation and deposition would imply that for these conditions adhesion 
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is nominally complete. Zebel (Z2) also extends the above analysis to com¬ 
puter calculations for actual distributions. 

In order to illustrate the quantitative implications of the above equations, 
Figs. 3-5 have been prepared. Figures 3 and 4 represent a plot of Eq. (16) 
using Eq. (21) to define K s . For air at 25°C and k c = 1, K s is 2.98 x 10" 10 
cm 3 /sec. Figure 3 shows flocculation rate in terms of number concentrations 
while Fig. 4 shows it in terms of mass concentration. It should be remembered 
that for the flocculation considered here, the mass concentration will always 
remain constant; only the number concentrations will change with time 
because of flocculation. Any change due to cloud expansion, which will 
cause mass concentrations to change, is a separate effect that would be super¬ 
imposed on the above. If k em in Figs. 3 and 4 is taken as unity, the figures 
will essentially give a direct representation of flocculation rates in the absence 
of electrical charges. It will be noted that at a concentration of 10 6 particles/ 
cm 3 , the flocculation rate is only about 2%/min. The common statement 
that an aerosol must have a concentration of less than 10 6 particles/cm 3 in 
order to be stable simply means that such an aerosol will be stable for a 
period of many minutes without significant flocculation. 

Figure 5 shows the effect of particle charge on flocculation rate. It is 
based on Eq. (17), basing N q on Eq. (22) and for N m = 0. This figure shows 
that a unipolar charge causes a very rapid reduction in k em , and hence in 
flocculation rate, while the rate of increase in flocculation rate due to ambi- 
polar charge is not as rapid. Figure 5 shows only the relative effect of charge 
on flocculation rate for various charge levels and particle sizes. It implies a 
very large relative effect for large particles but tells nothing about the absolute 
rate directly, since the particle flocculation rate decreases rapidly with par¬ 
ticle size for a given mass concentration. The absolute effect can be obtained 
by considering Figs. 4 and 5 simultaneously. 

For ambipolar charging, k em can be approximated by 

Km = 1 + N q (27) 

This will overestimate the value of k em by as much as 30% in the range of N q 
between 1 and 4 when N m is neglected. The difference between this approxi¬ 
mation for k em and the exact expression as given by Eq. (17) is best seen by 
comparing the dashed line in Fig. 5 with the solid line for ambipolar charges 
at S ps = 0.001. The same spread in the curves would exist at all other values 
of tfps . 

Thus, within this precision, it is apparent from Eq. (16) that the floccula¬ 
tion rate is an additive function of ordinary diffusion or Smoluchowski 
flocculation and electrostatic attraction. The ordinary diffusion rate is given 
by Eq. (16) for k em = 1, whereas the flocculation rate due to electrostatic 
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Fig. 3. Effect of particle number concentration on flocculation rate. 

effects alone for an ambipolar homopolar monodisperse aerosol is given by 

-drip ] = 2k c zbc p £* ^ 

n p dt J e np p 

This result is obtained by substituting the values of K s and N q from Eqs. 
(21) and (22), respectively, in Eq. (16) for k em - N q . It will be noted that 
this equation is identical to Eq. (9). It should be remembered, however, that 
Eq. (9) is the expression for the rate of change of concentration due to cloud 
expansion from net unipolar charging while Eq. (38) gives the rate of floccu¬ 
lation of particles within a given highly charged ambipolar cloud. Neverthe¬ 
less, this identity permits one to also use part of Fig. 2 to evaluate Eq. (28). 
It is interesting to note from Fig. 2 that for a concentration of lOOO^g/liter 
and a particle charge corresponding to S ps = 0.3 V/micron, the flocculation 
rate is 10 %/sec, which agrees reasonably well with the value obtained from 
Kunkel [Eq. (15)]. Equation (28) or Fig. 2, however, offers a more quantitative 
basis for assessing flocculation rate. 

For nonconducting equal spheres with equal surface distribution of charge, 
the following equation can be derived, which will give either (1) the time for 
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Fig. 4. Effect of particle mass concentration on flocculation rate. 


particles of opposite charge to coalesce if they are initially a distance L apart 
from center to center; (2) the time for particles of the same sign of charge 
that are initially almost in contact to be repelled to a distance L from center 
to center 



(29) 


In deriving this equation it was assumed that particle acceleration is a negli¬ 
gible factor, which is true if t is large compared to the specific stopping 
time, t ps . 
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PARTICLE DIAMETER , D p , microns 


Fig. 5. Effect of particle charge on flocculation rate. 


The relationship between particle spacing and mass concentration for 
particles in a cubical array is given by 



(30) 


Table IV indicates the magnitude of particle spacings corresponding to various 
concentrations assuming a cubical array and a particle density of 1 gm/cm 3 . 

If we calculate the mass concentration, c p , which would correspond to a 
cubic array of particles with a center-center distance of L, Eq. (29) can be 
written in terms of concentration to yield 


t = 


r w P 

\ 3 / \2k c z&c p £ ps 


(31) 


for the case where L is large compared to D p . Thus t is the time for a pair of 
particles to expand to or coalesce from a spacing corresponding to a con¬ 
centration c p . As might be expected, this is of the same form as Eq. (9) or 
(28). The difference is in the proportionality constant, which reflects the fact 
that Eq. (31) deals with individual particles on a specified path rather than a 
statistical migration between random particles. 
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TABLE IV 

Spacing between Particles for Various Particle Concentrations 
(for cubical array and p p = 1 gm/cm 3 ) 


Particle 

concentration 

Particle spacing 
in cubical array 

C L/D p ) 

32.8 Ib/ft 3 or 0.524 gm/cm 3 

1.000 (particles touching) 

1 lb/ft 3 gas 

3.20 

1 lb/lb gas or 1 gm/gm gas 

1.62° 

1 mg/cm 3 

8.06 

1 grain/ft 3 

61.2 

1 gm/meter 3 or 1000 /xg/liter 

80.6 

1 mg/ft 3 

246.0 

1 grain/1000 ft 3 

612.0 

1 /xg/liter or 1 mg/meter 3 

806.0 


a Based on air at 1 atm, 25°C. 


4. Floe Properties 

Having considered the factors that govern the extent or rate of flocculation 
in an aerosol, it is also important to consider the effects of flocculation on 
the properties of an aerosol. Consider a large sphere of diameter D pl , with 
its surface covered with small particles of diameter D pi . If the adhering 
particles are no more than one layer thick, it can be shown as a good first 
approximation that 



where n pl is the number of D pl particles adhering to the surface; D pf is the 
floe diameter; D pSfa is the apparent size of the floe (defined as that diameter 
of particle having the same density as the individual particle that will have 
the same Stokes’ settling velocity as the floe); and k d is a distribution factor. 
The above will be valid only if n pl is very large so that a large fraction of the 
surface will not be bare, and D p2 is very large compared to D pl . If the particles 

3a Assumes all particles have the same density. 
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are arranged on a tight square array, k d = 1. If the adhering small particles 
are more widely spaced, k d will be greater than 1. Thus, from Eq. (34) it is 
apparent that if the large particle is ten times as large as the small ones, the 
apparent size D pSfa will be only 6 % larger than D pl for an assumed tight 
square array. If fewer small particles adhere than would correspond to a 
tight square array, k d could become greater than n/2. This would make 
D pSfa less than Dpi and the adhering small particles would be exercising a 
buoyancy effect on the larger particle (in general, the small particles would 
serve to reduce the mobility of the large particle). Data are not available by 
which to assess this effect quantitatively; in any event, the limiting effect will 
probably be small. The purpose of the above discussion is to point out that 
even where a large particle is entirely covered with small ones, its settling 
velocity (or its apparent size, D pSfa ) will not be increased by a significant 
factor. The following relates this phenomenon to the electrostatic flocculation 
between large and small particles. 

Consider the deposition of small charged particles (D pl ) on a large par¬ 
ticle ( D p2 ) of opposite charge. When the large particle is completely neu¬ 
tralized, the final floe will contain n pl small particles and have a mass, m pfi 
given by 



™ P f = ™ p2 



(36) 3a 


where m pl is the mass of the large particle, and $ psl and $ ps2 are the corre¬ 
sponding charge levels of each of the respective size of particles. If g psl is 
of the same order as $ ps2 (i.e., if the particle charge is essentially propor¬ 
tional to particle surface), which is a likely case, it is apparent that n pl will 
be of the order ( D p2 /D pl ) 2 . From the previous paragraphs and Eq, (32), it 
will be observed that this number is not greater than that needed to cover the 
surface of the large particle. Also, the total mass of material associated with 
the small particle needed to neutralize the large particle is small compared with 
the mass of the large particle. Thus, neutralizing a large particle will require a 
large number but little mass of small particles and will not have a significant 
effect on the settling velocity of the large particle. Thus, there will be little 
effect on the aerosol in terms of its settling velocity distribution on a mass 
basis. On a number basis (which might be of importance in biological warfare 
applications), the effect would be large. 

If the small particles did not deposit as a coating but tended to form 
chains anchored to the large particle, the physical floe diameter would be 
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effectively very much larger but its effective size from the standpoint of 
settling velocity would be less. This is a case where the small particles would 
undoubtedly serve to reduce average particle mobility. The greatest effect of 
this condition would be in connection with those deposition mechanisms that 
depend on physical interception. Chainlike particles should be deposited 
more efficiently in the alveoli of the lungs, but fewer such particles should 
reach the alveoli. It is not certain what the net effect on alveoli deposition 
would be, although the total deposition in the respiratory system should be 
increased. 

For floes of a few particles of the same nominal size, Kunkel (K10) has 
reported the data given in Table V. 


TABLE V 

Data on Effective Size of Flocs" 


Floe 

arrangement 

Number of particles 
in floe 

Ratio 

/ effective floe diameter \ 

\discrete particle diameter/ 
(Dps/JDp) 

Particles in line 

1 

1.02 


2 

1.18 


3 

1.26 


4 

1.28 


8 

1.37 

Particles in a plane 

3 

1.29 


7 

1.54 

Clustered in space 

6 

1.59 


* From Kunkel (K10). 


From Kunkel’s data it is apparent that a fair number of particles can 
adhere together without radically changing the effective size. Floes formed 
as the result of attraction due to electrical charges cannot be composed of 
very many particles of the same size because of their tendency to neutralize 
each other. Where, because of low conductivity, actual neutralization is a 
slow process, the floe may consist of dipoles. While these dipoles are still 
capable of attracting particles, the force fields around such floes become 
rapidly weaker owing to the net neutralization that results even if localized 
areas on the floe are both positively and negatively charged. 

From the above it is concluded that flocculation due solely to electro¬ 
static attraction cannot have too great an effect in increasing the effective 
particle size because of the limits inherent in such flocculation. 
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5. Particle Stability 


As discussed previously, the presence of an electrostatic charge or force 
field results in a stress within the material. In the absence of polarization, this 
stress will always tend to rupture a particle. This tendency to rupture is 
resisted either by the action of surface tension or by the molecular bonds or 
tensile strength. 

In the absence of polarization effects, if the material can flow, as in a 
drop, and if the stress is applied slowly enough, an equilibrium is established 
with surface tension. When any perturbation is applied to this system, 
vibrations are set up with frequencies given by 


_ ( 2 n(n - 1) 
WPpD* 



£L 2 1 
2n 2 sD p 3 


; n > 2 


(37) 


where / is the frequency of the mode of vibration for which the departure of 
the drop from sphericity is described by the surface zonal harmonic 4 of 
integral order n, and a is the surface tension. When the bracketted term 
becomes zero or negative, the system becomes unstable and drop breakup 
ensues. These relationships were first evaluated by Rayleigh (R3) and are 
summarized by Hogan (HI2). 

Substituting for 2, p from Eq. (1), this breakup ensues when 


' ps 


2 (n + 2 )<x 
eD D 


(38) 


This has come to be known as the Rayleigh instability. The lowest admissible 
value of n is 2, for which 


8a 


f > 

PS bD d 


(39) 


It will be noted that when the outward pressure due to electrostatic stress 
[Eq. (4)] is made greater than the inward pressure due to surface tension 
[Eq. (5)], the same relationship as Eq. (39) is obtained. Hogan (H12) also 
reports that Schneider found that for liquid emitted from the end of a capil¬ 
lary only modes corresponding to odd values of n are permitted. Hence, 
since 3 is the lowest admissible value of n for this case 


<8% > 10 <j\zD p 


(40) 


4 A surface zonal harmonic of the first kind of order m and argument z is the poly¬ 
nomial of the m’th degree in z given by (\j2 m m\) (d m jdz m ) ( z 2 — 1) M and usually denoted 
by P m {z). In describing waves on a spherical surface these so-called Legendre polynomials 
fill the part played by trigonometric functions in describing the waves on a stretched string. 
In such applications z is the ratio to the radius of the perpendicular distance of a point 
on the spherical surface to an equatorial plane. 
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Hogan (HI2) also writes Eq. (40) in terms of the drop potential 

E% > 5<rD p l2e (41) 

This is valid only if the drop is spherical and is remote from any other object. 
Actually the drop potential is likely to be sensitive to geometry and Eq. (41) 
may not represent a very valid conversion. 

Values based on Eq. (39) are given in Fig. 6 in terms of S ps and in Fig. 7 
in terms of equivalent charge-to-mass ratio for various values of surface 
tension. Actually the maximum value of S ps before breakup, as given by 
Eq. (39), is independent of pressure. The pressure term has been incorporated 
in Figs. 6 and 7 in order to permit direct comparison with later developments 
of instability due to corona. For present purposes, Figs. 6 and 7 will give a 
correct representation of Eq. (39) if all pressure terms (in the coordinates or 
in the parameter) are ignored or assumed to be unity. 

Experimental data reported by Doyle, Moffett, and Vonnegut (D6) and, 
more indirectly, by Hendricks (H6) essentially check the order of charge 
predicted by Eq. (39). 

When an electrostatic field is applied so rapidly that flow phenomena 
cannot occur (or in the case of a solid, which does not flow), breakup may 
not occur until the electrostatic stress exceeds the tensile strength of the 
liquid. Schultz and Branson (S2) and Schultz and Wiech (S3) claim that this 
is the case in their liquid atomization studies. For this case, at breakup, 

> 2t> (42) 

Since for liquids x y is of the order of 2 x 10 9 dyn/cm 3 , $ ps must be of the 
order of 7,000 V/micron by this mechanism. Schultz and Wiech suggest that 
filaments of liquid may first be formed by Rayleigh instability from which 
are torn finer drops by local electrostatic stresses exceeding the tensile 
strength. 

The field strengths required for the mechanism suggested by Schultz and 
Wiech to occur are of the order of those that cause extensive field emission. 
Their experiments were conducted at high vacuum and such charge levels 
could not be stable at atmospheric conditions. 

D. On Coalescence 

Vonnegut, Moffett, Sliney, and Doyle (V3), Berg, Fernish, and Gaukler 
(B7), and Lindblad (L7) have confirmed Rayleigh’s (R2) original reports that 
coalescence rates of droplets can be radically increased by means of electro¬ 
static charges. Vonnegut et al worked with the coalescence of a jet of drops 
from a needle. They concluded that the field necessary to coalesce the drop 
increased with an increase in the relaxation time (product of resistivity and 
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NORMALIZED PARTICLE DIAMETER, pD p , (atm) (microns) 

Fig. 6. Maximum stable charge on particles expressed as specific particle surface 
gradients. 

permittivity) of the material. Lindblad, dealing with water drops, found that 
it took 200-1400 msec to coalesce the drops in the absence of charge, the 
time being about twice as great at 100 % relative humidity as at 0 %. This 
coalescence time decreased as voltage was applied across the drops, being 
10-20 msec at 1 V. Berg et al found essentially the same results at 1 V but 
extended the data to over 10 V with various liquids. At 10 V the coalescence 
time was only about \ msec. 

Berg, Fernish, and Gaukler (B7) indicated that their data extrapolate back 
to zero coalescence rate at zero voltage although they did not actually 
explore the region below 1 V. Presumably on the basis of this extrapolation, 
Berg (B3) concludes that charging of aerosol particles can increase retention 
in the respiratory system by many fold because of the increased rate of 
coalescence. An examination of Berg’s data leads us to the conclusion that 
at zero voltage the coalescence time is considerably greater than 10 msec. 
However, it is difficult to consider the extrapolation to infinite coalescence time 
(i.e., zero coalescence) at zero voltage as anything more than a possibility. The 
data can be extrapolated back to confirm Lindblad’s findings just as readily. 

The available data on retention in the respiratory tract, as summarized 
by Mitchell (M6) indicate that upwards of 20% of inhaled aerosols are 
retained in the respiratory tract, approaching 100% for particles over 5 
microns in diameter. The order of retention follows reasonably well what 
would be expected from the physics of the various deposition mechanisms 
assuming that once deposited a particle is retained by the surface. This would, 
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Fig. 7. Maximum stable charge on particles expressed as equivalent charge-to-mass 
ratio. 


therefore, appear to refute Berg’s claim that charging is necessary to secure 
aerosol retention. It can, of course, always be argued that some stray charges 
are usually present to cause coalescence. This cannot be readily refuted. 
However, if this is the case, the low level of charge needed would appear to 
be so universally present that there need be no concern about lack of 
coalescence under normal conditions. 

The above comments on coalescence are aimed specifically at the area of 
deposition in the respiratory tract and are not intended for universal appli¬ 
cation. The same comments would probably apply to any case where rela¬ 
tively small particles (<10 microns) are deposited at relatively low velocities 
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(<1 ft/sec). For other conditions, involving more vigorous contact, it is 
known that rebound can occur. For example, it was indicated by Wright 
et al (W6) that the extent of adhesion of 1-micron ammonium bromide 
crystals was reduced as deposition velocities exceeded 1 ft/sec. 

E. On Evaporation 

It has previously been indicated that charges have comparatively little 
direct effect on the vapor pressure of a material until the particle diameter is 
substantially less than 1 micron and their effect is toward keeping the vapor 
pressure closer to that from a plane surface. Consequently, it may be 
concluded that charge will not significantly affect evaporation rate of a 
given droplet. 

The data of Doyle et al (D6) would indicate that a charged evaporating 
droplet cannot lose charge except with mechanical loss of material. As a 
droplet evaporates any charge remains until the charge concentration is high 
enough to result in Rayleigh instability [Eq. (39)]. At that point there is an 
emission of fine drops (less than 15 microns diam) from the parent droplet 
These drops are few in number (perhaps 10 from a 100-micron parent), 
represent a negligible part (say 3 %) of the mass of the parent, but carry an 
appreciable part (say 30 %) of the charge. This might be expected since the 
ejected fine drops presumably arise at the surface where the charges are 
concentrated. It seems likely that the ejected droplets have a charge level 
close to the maximum permitted by Rayleigh instability. In that case, they 
should almost immediately eject another series of drops as they in turn 
evaporate. 

Thus while the presence of electrostatic charge will tend to hasten the 
evaporation process because of the extended surface provided by the ejected 
drops, this is probably not a major consideration since the initial stage of 
evaporation of a droplet is actually the one that limits the over-all rate. The 
concentration of charge within the drop, however, means that evaporation 
of a drop can ultimately yield very highly charged small particles of residual 
nonvolatile soluble or suspended contaminants. 

F. On Deposition 

A number of investigators have indicated increased deposition rates due 
to electrostatic charges. LaMer et al (L2) and Gillespie (see Richardson, R5) 
were concerned with aerosol filtration. Gohlich (G5), using corona charging 
of air suspensions of powders, reports two- to threefold increases in field 
deposition rates as a result of the charges. The greatest improvements, how¬ 
ever, are reported by Vonnegut et al (V3) who reported nine- to 83-fold 
increases in deposition on surfaces as the result of electrostatic charges. In 
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this study they used both dc and ac corona charging of DOP (dioctylphtha- 
late) particles. Although they indicate that an ac charged cloud should be 
neutral, they do not report any check of this point. With ac corona there is 
usually some net rectification. 

Ambipolar charges can give increased deposition but such deposition 
depends for the most part on image effects. In this sense the rate of deposition 
will be comparable with the rate of flocculation of ambipolar aerosols. 
Improved deposition rates by such means are likely to be small. 

With unipolar (or net unipolar) clouds, however, the rate of deposition 
will be essentially the same as the rate of expansion of the edge of the cloud. 
In fact, with very large conductors or grounded objects, because of image 
forces (which will reflect those of the entire cloud rather than just those of 
an individual particle), the rate of deposition would be expected to be just 
twice as fast as the rate of expansion of the edge of the cloud. Thus, since the 
deposition rate will be the product of velocity and concentration, one may 
use Eqs. (9) and (11) to predict the following deposition rate from a net 
unipolar charged cloud: 


G de = 2k c e<5„ c 2 <S 2 ps Dl(3np„) (43) 

where G de is the mass deposition rate per unit of surface area exposed to the 
cloud. The factor of 2, to allow for image forces, has been incorporated in 
Eq. (43). Although D was originally defined as cloud diameter, this assumes 
a free cloud. Therefore, in applying Eq. (43), D will be essentially the apparent 
cloud size that the surface sees. For example, the top surface of the top 
leaves on a plant in the cloud will see the entire cloud. The under surfaces will 
see only that cloud between them and the next layer of leaves, so that D for 
that case will be radically reduced. This results because one layer of leaves 
will electrically shield the adjacent lower layer from the cloud above. Simi¬ 
larly, the deposition on top of a man’s head may be determined by the whole 
cloud diameter, whereas the deposition on his side will correspond to an 
effective diameter more nearly approximating his height. These are only 
approximations since the actual effects will depend on the field intensities as 
determined by the specific geometries involved. This can become an ex¬ 
ceedingly complex estimate to make. 

In order to provide some basis for comparison, the deposition rate per 
unit of surface area due to gravity settling is given by 

G dg = g(p p ~p)k c c p D 2 /(m (44) 

The ratio G de /G dg is, therefore, 

G de _ Uz5 v c p g 2 ps D 
G dg g p p (p p - P)D 2 


( 45 ) 
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For D = 3 in., c p = 1000 /ig/liter, $ ps = 1 V/micron, Z) p = 10 microns, 

= 1 gm/cm 3 , and S v = 1, the ratio G de /G dg will be 8.25. Thus, even for 
such a small exposed charged cloud, deposition by electrostatic means will 
be almost an order of magnitude greater than by gravity. The electrostatic 
mechanism will also result in deposition on the under side of the surface. 

A detailed analysis of the various mechanisms of electrostatic deposition 
has been presented by Kraemer (K6), Kraemer and Johnstone (K7), and 
Dawkins (D3). 

G. On Adhesion 

Corn (C7) has given a recent review of adhesive forces between particles 
in which he concludes that inadequate data on static electrification are avail¬ 
able to arrive at any definite conclusion as to the relative role of electro¬ 
statics in particle adhesion. A detailed evaluation of the analysis presented 
by Russell (R9), however, sheds considerable light on the subject. 

1. Electrostatic Force between Charged Particles 

Charged particles will exert a force on each other that depends on the 
magnitude of the charge and on the distance between particles. This force 
is given by the simple Coulomb law [Eq. (2)] for point charges or for charged 
particles that are far apart compared with their diameter. When particles are 
brought close together, however, the nature of the force depends on the 
nature of the particle as well. On particles of perfect insulators the charges 
will remain in their respective locations on the particle regardless of the 
proximity of another charged particle. With conductors, on the other hand, 
the charges can shift because of induction effects. The charges on a conducting 
particle will move relative to the center of mass of that particle, toward those 
of the opposite sign on a proximate particle. This will result in a greater force 
of attraction between the particles than would be calculated from the simple 
Coulomb law without allowing for migration of the charges within the 
particles. Where the proximate particle has charges of the same sign, the 
charges will move further apart resulting in a lower repulsive force than 
indicated by Coulomb’s law naively applied. The force fields around and the 
fields between spherical bodies have been extensively treated by Russell (R9). 
The force between particles can be expressed by a modified form of the 
Coulomb law. For conducting spheres of equal size and equal but opposite 
charge, the force of attraction is given by 
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where £ p is the magnitude of charge on each particle; L is the distance 
between centers; and k sA is a factor to allow for any inductive shifting of 
charges. This factor is a function of the relative distance between the particles 
[L/D p or LJD p , where L s is the clearance between particles and is equal 
to (L~D p )l 

The maximum field intensity around such particles will exist at the surface 
of each particle at the point of intersection of their line of centers and may 
be expressed as 


<^max = k s g$ ps = k sE (£/L s ) (47) 

where £ ps is a measure of particle charge as defined by Eq. (1), E is the 
potential difference between the spheres, and k sg and k sE are functions of 
L/D p or LJD P . 

The capacitance between such a pair of spheres is given by 


=2 p IE = 2nsk sV D p (48) 

For conducting spheres of equal size and equal and like charge, the force 
of repulsion is given by 


F R = k : 


sR 


V 1 

, \nsd£ 2 ps D p 2 ' 


SR L 4 (L/D p ) 2 J 


(49) 


where k sR is a function of L/D p or LJD P . 

Table VI shows values of k sA , k sR , k sS , k sE , and k s % as given by Russell 
(R9) for various particle spacings. 

Equations (46) and (49) would also hold for two equal spheres that are 
perfect insulators. If the charge is initially uniformly distributed within the 
sphere or on its surface, the value of k sA or k sR for such insulator spheres 
would be unity regardless of spacing since the charges cannot migrate. This, 
however, assumes that no polarization has taken place. 

Equations (46) and (47) may be combined with the definition of Eq. (1) 
to yield 


F A = k 


FS 


nz8D n 1 i i 


2 l 

max 


(50) 


where 


k FS — 


k^i + (LjD p )y 


(51) 


This gives the relationship between the attractive force between two particles 
and the maximum surface field intensity. There is a limit to the magnitude 
that S max can reach without incurring some type of discharge or emission. 
From data such as those of Earhart (El) one would conclude that <f max 
cannot exceed some 100 to 200 V/micron before the particle would discharge 



TABLE VI 

Factors for Forces, Field Intensity, and Capacity between Equal-Sized, Equally Charged, Conducting Spheres' 


2 L S / D „ 

2 LID , 

1 + ( LJD „) 

k sA 

k S R 

k s g 

ksE 

k s <g 

kpg 

icr 7 

2 . 0 6 1 

1.000 

5.77 x 10 5 

0.615 

2.140 x 10 6 

1.000 

4.6647 

1.260 x lO’ 7 

10- 6 

2 . O5I 

1.000 

2.99 x 10 4 

0.615 

2.442 x 10 5 

1.000 

4.0891 

5.01 x 10" 7 

10- 5 

2 . O4I 

1.000 

4.05 x 10 3 

0.615 

2.845 x 10 4 

1.000 

3.5134 

5.00 x 10" 6 

10 -“ 

2 . O3I 

1.000 

5.79 x 10 2 

0.615 

3.400 x 10 3 

1.000 

2.9378 

5.01 x lO' 5 

10- 3 

2 . O2I 

1.000 

91.1 

0.615 

423 . 

1.000 

2.3626 

5.09 X lO" 4 

10- 2 

2.01 

1.005 

15.60 

0.618 

56,1 

1.003 

1.7896 

4.91 x 10" 3 

10- 1 

2.10 

1.050 

3.355 

0.643 

8.38 

1.034 

1.2237 

4.34 X 10~ 2 

0.5 

2.5 

1.250 

1.507 

0.757 

2.640 

1.173 

0.8892 

0.1381 

1 

3 

1.500 

1.213 

0.850 

1.770 

1.359 

0.7677 

0.1722 

2 

4 

2.000 

1.072 

0.935 

1.320 

1.770 

0.6705 

0.1538 

5 

7 

3.500 

1.012 

0.990 

1.080 

3.151 

0.5836 

0.0709 

8 

10 

5.000 

1.004 

0.996 

1.033 

4.604 

0.5556 

0.0378 

10 

12 

6.000 

1.003 

0.997 

1.026 

5.587 

0.5452 

0.02645 

100 

102 

51.000 

1.000 

1.000 

1.000 

50.51 

0.5049 

0.000384 


From Russell (R9). 
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Fig. 8. Attractive force between equal-sized, equally and oppositely charged spheres 
at various particle clearances. 

itself in air at 1 atm. Equation (50) is useful, therefore, for calculating the 
maximum possible force that can exist between two particles. The various 
factors grouped inside the last bracket are all a function of the clearance 
between particles and the value of this group, k FS , is given in Table VI. 
Since k FS is a maximum at a clearance of approximately one particle radius, 
the maximum force of attraction will exist at this spacing. As the particles 
move closer together, the particles will lose charge so that the force of 
attraction will actually become less. This assumes, of course, that there is 
actually a limiting field intensity, a fact which is indicated but has not been 
proven. 

Equation (50) may also be used to calculate the attractive force between 
equal sized, equally and oppositely charged spheres that are perfect insulators. 
For this condition, the value of k sA and of k sS is taken as unity if polarization 
possibilities are neglected, and if the charge is initially uniformly distributed. 
Any polarization will tend toward an approach to the conductor condition 
(which basically represents a condition of infinite polarization). Figure 8 
presents a plot of Eq. (50). Thus, if we assume that the maximum possible 
field intensity is 200 V/micron, the attractive force between equally but 
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oppositely charged, conducting spheres of 1 micron cannot exceed 0.0048 dyn. 
This force would occur when the particles are approximately \ micron apart 
provided that the particles had an initial charge at least corresponding to a 
value of S ps of 113 V/micron [as calculated from Eq. (47)]. Any charge in 
excess of this would be lost as the particles approach each other. As the 
particles continue to come closer together than \ micron, additional charge 
will be lost and the attractive force gradually drops to zero. 

For the same conditions with insulating spheres the maximum attractive 
force would be 0.028 dyn and would occur when the particles are touching. 
It might be argued that because of localized regions of very high charge an 
even higher attractive force could result. This, however, is unlikely because 
of the loss of charge that would take place at those spots. Where S ps is less 
than would correspond to the limiting critical field strength, however, such 
localized concentrations could result in higher attractive forces than would be 
calculated on the basis of the average £ ps value. It should be noted that with 
a perfect insulator there will also be a charge loss in the region of closest 
proximity. However, this will be confined to a relatively narrow region and 
will not be a major factor in the total charge left on the particles. The exact 
solution for this case would be quite complex. 

Table VII gives both the electrostatic field intensity and force due to a 
point charge corresponding to one electron in magnitude. It is apparent 
from these values that elementary point charges do not exert significant 
forces upon each other until the distance between them approaches molecular 
dimensions. At this point, field intensities become so high that electron 
transfer or tunneling must occur for the case where the point charges are of 
opposite sign. 

The presence of insulating films over the surface of charged conducting 
particles would tend to have relatively little effect, since the charges would 

TABLE VII 

Force and Field Intensity due to Single Electron Point 
Charge 


Radial Field intensity Force on another 

distance from due to point point charge of 

point charge charge 1 electron 

(A) (micron) (V/cm) (judyn) 


1 

0.0001 

1.437 

X 

10 9 

2.300 

X 

10 3 

10 

0.001 

1.437 

X 

10 7 

2.300 

X 

10 1 

100 

0.01 

1.437 

X 

10 5 

2.300 

X 

10" 

1,000 

0.1 

1.437 

X 

10 3 

2.300 

X 

10" 

10,000 

1 

1.437 

X 

10 1 

2.300 

X 

10" 
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still be free to migrate and leak off at the point of closest proximity. They 
might, however, alter somewhat the critical value of field intensity at which 
charges leak off, since this field intensity may be a function of the surface 
material. It should also be noted that, since no materials are perfect insulators, 
the above discussion is strictly relative to the time periods under considera¬ 
tion. The distinction between insulators and conductors can be drawn on the 
basis of the relaxation time of the material relative to the time period of 
importance as discussed previously. 

From the above discussion it is apparent that while charge can cause 
particles to come together, it cannot be a significant factor in determining 
the force of adhesion with conductive particles. With perfect insulators, very 
high charge levels (corresponding to $ ps > 100 V/micron) can result in an 
effective force of adhesion. 


2. Comparison with Other Forces 

In order to illustrate further the magnitude of electrostatic forces in 
relation to other forces, Fig. 9 has been prepared. In this figure the forces 
due to various mechanisms have been depicted as a function of particle size. 
The equations that are plotted in Fig. 9 are given in Table VIII. In order to 
simplify the presentation, the various forces have been presented for specific 
conditions as indicated in the legend. An attempt has been made to select 
conditions for which each particular mechanism will exert a maximum effect. 
In doing this within the bounds of realism, an oversimplification may have 
resulted. Flence before considering the figure in general some specific explana¬ 
tions are in order. 

Some of the forces presented are actually pseudo- or equivalent forces. 
This is true in the case of diffusion. The force formula given is the equivalent 
force that would give a particle the same migration velocity toward a surface 
as it experiences by diffusion when in close proximity to the surface. 

Of adhesive forces other than electrostatic, only van der Waal’s forces 
have been presented. Jordan (Jl), in discussing the work of Bradley (Bll) 
and others, indicates that the force of attraction between two particles of 
diameter D pl and D p2 may be expressed as 


F A = K w D pl D p2 /(D pl + D p2 ) (52) 

where K w is a constant of the material with a magnitude of the order of 
200 dyn/cm (0.2 N/meter). For equal-sized particles this becomes 


F a = (K w /2)D p 


( 53 ) 
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Fig. 9. Comparison of forces arising from various mechanisms. 

Corn (C7), however, points out that adsorbed films or gas layers may exist 
between the particles and may alter the nature of the adhesive force. Corn 
also indicates that various investigators have derived a value of the order 
of 4 no for K w when adsorbed liquid films are involved. Bradley (Bll) has 
derived an identical expression where o is defined as the surface energy of 
the solid. All of these forms yield values of K w of the same general magnitude. 
There are, however, other reports (Fuchs, F4, pp. 363, 373) that indicate 
adhesive forces between particles as much as two orders of magnitude smaller 
than these. 

For the forces due to fluid drag (shear and pressure), the maximum 
velocity has arbitrarily been taken as 1000 ft/sec. This, being essentially sonic 
velocity in room air, is the largest velocity that will normally be realized. 
However, for all velocities over say 300 ft/sec, compressibility effects will 
become significant. These are not allowed for in the equations and to this 
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TABLE VIII 


Equations Plotted in Fig. 9 


Type force 

Force equation 

Values of constants and physical 
properties used in preparing plot 

Earth’s gravity, 
less buoyancy of 
fluid 

F = ngp p D p 3 [(p„ - p)lp p \l 6 

p p = 1.00 gm/cm 3 (density of 
particle) 

p = 1.185 x 10“ 3 gm/cm 3 
(density of air) 

Inertial force to 
give acceleration a 

F== 7Tap p D p 3 l6 

p p = 1.00 gm/cm 3 
a — 10 4 gravities 

Total fluid drag 
at relative 
velocity u r 

F= TTpC D u r 2 D p 2 /8 
where 

C D = <P(N Re „) 

= 24/JV* e „) 

if N Rep < 0.2 
= 0.44 if N Rep > 1000 

Air at 25°C, 1 atm; 

p = 1.185 X 10 -3 gm/cm 3 
p, = 0.0184 cp 

u r = 1, 10, 100, and 1000 ft/sec 
(as indicated on curves) 

Diffusion— 
equivalent force 
causing migration 
over distance L m 

F=kT/L m 

T = 25°C (298°K) 

L m = lA (10 -8 cm) 

van der Waals 
attraction between 
two equal spheres 

F= hK w D p 

K w = 200 dyn/cm 

Electrostatic 
attraction of two 
equal spherical 
insulators 

TTe8£ 2 ps D p 2 

4 (L/D p ) 2 

8 = 1 

( L/D p ) = 1 (particles touching) 

= 0.1, 1, 10, 100 and 200 
V/micron 

(as indicated on curves) 


extent the values plotted are fictitious, although still of the correct order of 
magnitude. The curves for forces due to fluid drag are based on data for the 
drag coefficient, C D , for spherical particles in infinite space. In the case of 
electrostatic forces, as will be shown later, the limiting surface field strength 
that a particle can tolerate without ionizing air becomes less as size increases. 
This limit is indicated in a very approximate way by the dotted curve in 
Fig. 9, although such limits are not actually known with any precision. 

It will be noted in Fig. 9 that diffusional forces are small compared with 
other forces for particles smaller than 1000 microns in diameter. Inertial 
forces can become significant with particles larger than 10 microns. Fluid 
drag forces can be of the same general order as van der Waal's forces at these 
assumed high velocity levels. Electrostatic forces can be quite large but only 
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if relatively high charge levels are involved. If the charge level is as low as 
would correspond to S ps = 1 V/micron, the forces begin to approach those 
due to the earth’s gravitational field. 

It is doubtful whether electrostatic charges can play a significant role in 
adhesion under any but the most unusual circumstances. Although high force 
levels are shown in Fig. 9, there is considerable question as to whether the 
field intensities indicated can be maintained for any but the best insulators. 
It should also be remembered that conductive particles can show no adhesion 
due to electrostatic charges unless they are covered with an insulating film, 
and even then the magnitude of the possible force is very much less than 
with insulating particles. The subjects of switchgear arcing and electrical 
contact wear are areas that have not been covered in this study and may 
contain some relevant data. 

H. On Biological Processes 

No attempt has been made in this chapter to cover the biological or 
medical effects of electrostatic charges. It should be noted, however, that 
Gordieyeff* (G6) gives a bibliography of forty-two references, a large number 
of which deal with the medical aspects. 


IV. Electrostatic Dissemination Techniques 

Electrostatic forces can be utilized for the direct dissemination of materials, 
both liquid and solid. The dissemination of liquids by this means (electro¬ 
static atomization) has been studied extensively. The corresponding dis¬ 
semination of powders is referred to in the literature but has not received 
much attention in depth. Dissemination of bulk solids (i.e., consolidated 
solids as distinguished from powders) by electrostatic means has not been 
studied and, although it may be technically possible, is probably not feasible. 

A. Electrostatic Atomization of Liquids 

A large part of the interest in electrostatic atomization has arisen from 
two fields of interest: combustion of fuel oils and ion propulsion of rockets 
in space. Work in the latter area has been concerned primarily with atomiza¬ 
tion at high vacuum (10" 8 atm) as in the work of Cohen (C5), Hogan (HI2, 
13), Hendricks (H6), Schultz and Branson (S2), and Schultz and Wiech (S3). 
Graf (G8), Matthews and Mason (M3), Peskin and Raco (P3), Randall, 
Marshall, and Tschernitz (Rl), and Vonnegut and Neubauer (V4) have been 
concerned primarily with atomization at atmospheric pressure. Most of the 
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investigators claim that atomization is achieved as the result of Rayleigh 
instability. However, Schultz et ah claim that the ultimate atomization in 
their case is due to rupturing molecular bonds because of the high discharge 
rates involved in their tests. 

The data in general seem to indicate that greater conductivity (lower 
resistivity) is conducive to better atomization. The failure to obtain atomiza¬ 
tion at very high resistivities in the type of apparatus used by Drozin (D7) 
and Randall et ah (Rl) is easily understood since the source of high potential 
is far removed from the point of atomization. The poor atomization reported 
for liquids of very high conductivity is more difficult to understand, however. 
It also seems clear that there are two general modes of atomization. The first 
is due to Rayleigh instability, which results in filament formation and prob¬ 
ably is capable of giving only relatively coarse droplets (> 10 microns). The 
other involves a more intensive atomization for which the mechanism is not 
quite clear. It may involve an intensification of the Rayleigh instability, 
corona, or breaking of molecular bonds as suggested by Schultz and Branson 
(S2). It must be concluded that, while guidelines have been established for 
electrostatic atomization, the principles are far from understood and that at 
present there is no means of predicting atomization performance either from 
the standpoint of the charge level or the size attained by the drops formed. 
The rest of this section is devoted to a brief resume of the data generated 
during some of the more important investigations in this area. 

The high vacuum work was aimed primarily at achieving relatively high 
charge-to-mass ratios. These investigations were essentially all made with 
flow of liquid out of a fine charged capillary or over a fine charged needle. 
Hendricks (H6), atomizing Octoil (dioctyl phthalate), obtained a range of 
charge-to-mass ratios of 0.01 to 5 C/kg when producing particles in the size 
range of 0.2-10 microns. His highest charge levels corresponded to S ps values 
of about 80 V/micron. Schultz and Branson (S2) and Schultz and Wiech (S3) 
claim that by employing an electron emitter arranged to direct electrons at 
the liquid to be atomized, much higher charge levels (up to 300 C/kg) could 
be achieved. They report results of 80-130 C/kg when atomizing dioctyl 
phthalate from a fine positively charged needle point. They attribute the 
improvement over Hendricks’ results to high charge emission rates made 
possible by the electron emitter, thereby overcoming liquid tensile strength 
rather than simply surface tension. Cohen (C5) obtained charge levels of 
100-1000 C/kg when using an atomizing needle of either positive or negative 
polarity in the absence of any emitter. Cohen, however, did use a liquid 
(glycerol) doped with antimony chloride or sulfuric acid to make it conductive. 
Schultz and Branson (S2) report that for resistivities greater than 10 14 Q * cm, 
the liquid cannot be charged except by either liquid breakdown or corona. 
Neither Schultz and Branson nor Cohen report any data on particle size. 
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Hogan (H12) obtained R c values of 0.01-10 C/kg. With dibutyl phthalate, 
however, he reports a mode of atomization in which no liquid jets could be 
seen with a microscope and for which R c values of 10 5 C/kg were measured, 
corresponding to singly ionized dimer molecules. No particles could be 
formed with silicone pump oil, whereas charges of 200-400 C/kg were 
obtained with suspensions of either Cab-0-Sil 4a or carbon in glycerine. 

In summary, the data on high vacuum atomization have in general dealt 
with charge levels much higher than those of interest in dissemination and 
have given little attention to the question of particle size. Hogan (H12) does 
give a good summary of theoretical considerations involved. 

Peskin and Raco (P3) have given a theoretical analysis of both ultrasonic 
and electrostatic atomization from the point of view of liquid instability. 
They conclude that atomization with low frequency ac will require about 
twice the field strength as dc but that, by going to high frequency, lower 
fields are possible with conducting liquids. The value for the critical field for 
atomization given by these authors for a dc field is, however, smaller than 
that which would be calculated from Eq. (39) by a factor of (1/32) 1/2 . This 
presumably reflects the simplified one-dimensional model used in their 
derivation. 

Matthews and Mason (M3) consider the rupture of a large droplet due 
to aerodynamic forces both with and without the presence of an external 
electrostatic field. With the field present, the charge on the drop fragments 
was of the order of that which might be expected by induction from the field, 
and the charge also reversed sign with the field. The particle charge attained 
with the external field was also some two orders of magnitude larger than 
any charge attained in the absence of the field. This study was aimed primarily 
at explaining charge buildup in thunderstorms. 

Graf (G8) measured the pressure buildup due to electrostatic charging of 
stationary drops. He found this pressure to be some 30-60% of the value 
given by Eq. (4), independent of polarity of charge or dielectric constant of 
the liquid. This factor, which he terms a charging factor, was a function of 
both geometry and liquid properties. It may also reflect polarization effects 
that are neglected in Eq. (4). 

Graf (G8) also briefly investigated atomization. Charge-to-mass ratios 
of the order of 10" 3 C/kg were obtained with oils having resistivities of 10 8 - 
10 12 Q • cm. No particle size data are given but he reports better atomization 
and higher charging rates as the resistivity of the liquid becomes less. 

Drozin (D7) presents an analysis corresponding to Eq. (4) for a polariz¬ 
able liquid in the absence of any surface charge. However, he expresses the 
electrostatic pressure in terms of the electric field inside the drop rather than 


4a A very fine silica marketed by Cabot Corp. 
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in terms of that external to the drop. As a result, his theoretical analysis 
shows an apparent marked effect of dielectric constant of the liquid. If the 
conversion is made, dielectric constant is found to have relatively little effect 
as reported by most investigators. Using a buret-type atomizer, Drozin 
reports four stages of atomization as the field intensity is increased: (1) for¬ 
mation of large uniform drops with low or no fields; (2) formation of liquid 
filaments; (3) eruption of fine drops from the filaments; and (4) gradual 
shortening of filaments and replacement of them by a cloud of fine droplets. 
When a complete cloud of fine droplets only formed, it exhibited higher order 
Tyndall spectra, implying a relatively uniform size distribution. He also 
reports that, with a grounded electrode beneath his atomizer, voltages in 
excess of the condition resulting in a complete cloud caused a traversal of 
the above atomization stages in reverse order. Drozin claims that the fine 
particle cloud condition can only be achieved when the liquid is positively 
charged. It is not clear whether this represents his own observation or is a 
conclusion based on the data of Vonnegut and Neubauer (V4) who reported 
that a “ smoke ” could be obtained only with a positively charged liquid and 
only with liquids of very low conductivity (distilled water, lubrication oil, 
alcohol). Drozin also concludes that atomization is possible only if resis¬ 
tivity is between 10 5 and 10 13 Q • cm; liquids that are more conductive (such 
as salts and acids) or less conductive (such as xylene or carbon tetrachloride) 
could not be dispersed even with potentials up to 30 kV. Drozin also con¬ 
cludes that liquids with dipole moments smaller than 10" 18 dyn 1/2 - cm 2 
cannot be dispersed, in agreement with the conclusions of Straubel (SI2). 

Vonnegut, Moffett, Sliney, and Doyle (V3) in general confirm the modes 
of atomization reported by Drozin. In their case the drops were mostly 
greater than 100 microns except for the smoke or fine-particle cloud condi¬ 
tion. They also report that the smoke was obtainable only with positive polarity 
at high voltage and that very conductive water (resistivity less than 10 3 Q • cm) 
or poorly conductive materials (resistivity greater than 10 13 Q. * cm) would 
not form smoke. They also report that drop formation either becomes erratic 
or ceases when corona ensues. 

Randall et al (Rl) examined some 60 liquids with a buret-type atomizer 
and positive dc voltage. They also concluded that atomization could not be 
achieved if the resistivity was greater than 10 10 Q • cm and became much 
poorer if the resistivity was less than 10 5 Q * cm. Increased viscosity resulted 
in a coarser spray, especially for viscosities above 20 cp. Measurements of 
size distribution are reported which indicate mass median diameters larger 
than 20 microns. An optimum voltage of 25^40 kV was found at which the 
mass median diameter was a minimum. They also ran limited tests with glass 
capillary atomizers (<200 microns i.d.) for which they estimated droplet 
sizes were less than 10 microns when atomization was achieved. They give a 
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semiquantitative description of the type of atomization achieved with each 
type of atomizer at various voltages and with various liquids. 

Shaffer (S5) also made some exploratory evaluations of the electrostatic 
atomization of dibutyl phthalate using a camel's hair brush for the atomizing 
nozzle. On a count basis 73% of the particles were smaller than 10 microns 
and the largest particle obtained was 40 microns. The energy input corre¬ 
sponded to 0.5 cal/g liquid atomized (0.00026 kWh/lb) and the charge level 
on the particles as atomized corresponded to a value of 8 ps of the order of 
3-5 V/micron. Current and flow rate measurements reported by Yonnegut 
and Neubauer (V4) would correspond to an energy input of 0.1 kWh/lb. 

As a criterion for determining the size droplets formed by electrostatic 
atomization, Vonnegut and Neubauer (V4) postulated that the total energy 
of the system should be a minimum at equilibrium. On this basis they con¬ 
cluded that the most probable droplet diameter would be given by 

D p = (72£7 2 <7/i2 2 ) 1/3 (54) 

where V is the volume of liquid, having a surface tension cr and charge 1, 
that is atomized. This can also be written in terms of charge-to-mass ratio, R c , 

D p = (' nzalp p 2 R c 2 ) lli (55) 

Both Hogan (H12) and Ryce (R10) have discussed this relationship. However, 
it has not been recognized that if Eq. (55) applies to an atomized jet it must 
also apply to a droplet that is evaporating. When this drop reaches a critical 
charge level, as given by Eq. (39), it will atomize as reported by Doyle et al. 
(D6). This critical charge level, expressing Eq. (39) in terms of charge-to-mass 
ratio, is given by 

R c 2 = [288e olp p 2 Dlo] (56) 

where D p0 is the diameter of the evaporating drop before it atomizes. Since, 
if the whole drop atomizes, the final charge-to-mass ratio remains the same, 
combining Eqs. (55) and (56) will give the final drop diameter, D p , in terms 
of the initial drop diameter, or 

D p = D p e/4 1 / 3 (57) 

This is equivalent to dividing the original particle into four equal pieces. 
From Doyle’s data, this was obviously not the case since a few very small 
droplets of higher charge-to-mass ratio were ejected from the large droplet. 
One may conclude, therefore, that an atomizing system is not an equilibrium 
system and that the approach leading to Eq. (55) is not valid. 

Dunskii and Kitaev (D9) describe a pneumatic atomizing nozzle in which 
there is an added provision for electrostatic charging with a potential of 
1000 V and a power of 0.1 W. They report that the drops formed in the 
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presence of the electrostatic field are two to four times smaller than with 
pneumatic dissemination alone and that more aerosol is deposited on the 
ceiling of the testing chamber when the field is applied. They claim that the 
charging is by induction rather than by corona, 

B. Electrostatic Dissemination of Powders 

Electrostatics plays two separate and distinct roles in the dissemination 
of powders. It can be employed as an actual disseminating force to cause 
outward migration of the individual particles or it can be used simply as an 
adjunct to some other technique of dissemination, such as pneumatic dis¬ 
semination. The former is entirely responsible for the transfer of powders 
in certain forms of electrostatic printing. Some of the problems and con¬ 
siderations involved have been discussed by Schultz and Wiech (S3) in con¬ 
nection with the use of powders for ion propulsion. A number of articles 
refer to dissemination of charged powders, but in most of these cases the 
dissemination process is pneumatic and electrostatic fields are employed 
primarily to obtain unipolar aerosols rather than for dissemination purposes. 
It is, however, likely that the electrostatic field will also influence the dis¬ 
semination process, whether this was the intent or not. Gohlich (G5) discusses 
systems for pneumatic dissemination with simultaneous charging. Kitaev (K4) 
describes corona charging of powders and concludes from the usual charging 
equations that all particles will be almost fully charged within 0.01 sec. The 
nozzle arrangement described by Dunskii and Kitaev (D9) would appear to 
be equally applicable for dissemination of powders. 


V. Methods of Charging or Discharging Particles 
A. Maximum Stable Charge 

Any particle in the atmosphere will gain or lose charge because of chance 
encounters with ions, photons, etc., but these events are usually so rare that 
charge changes due to them are negligible over periods of, at least, minutes 
to hours. Schonland (SI), for example, reports data for the atmosphere that 
would correspond to relaxation times, t R , ranging from 10 min to 2 hr near 
ground level. He also indicates that small ion concentrations may range 
from 100 to 1000 ions/cm 3 while large ion concentrations may range from 
200 to 80,000 ions/cm 3 . These are all small compared to particulate concen¬ 
trations of concern in agent dissemination. Chalmers (Cl) reports fine weather 
relaxation times of 5-30 min. However, when the charge of the particle 
becomes sufficiently high it will either emit electrons or cause electrons to be 
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generated in the surrounding atmosphere, which will serve to neutralize the 
charge on the particle. Actually, this can happen to a minor extent at all 
levels of charging but does not become significant until the charge level 
reaches a specific critical level. This is identical to the level at which a wire 
or other charged body will go into corona. 

In the normal atmosphere if a large body is charged to such an extent 
that the outside surface field intensity (or exceeds approximately 30,000 
V/cm (3 V/micron), corona will ensue. This critical field intensity may be 
somewhat higher with a negatively charged body than with a positively 
charged one. The critical field intensity is also known as the electrical break¬ 
down strength of air. It is strictly valid only for a uniform field, as between 
flat plates, 5 but is still approximately true for large bodies outside of which 
the field intensity does not drop off too rapidly. 

In order to achieve breakdown, electrons (either from the air or from the 
body) must be accelerated to a sufficient velocity to ionize the air and breed 
more electrons by any one of several processes. In an actual gas, however, 
some of the kinetic energy of the electrons is lost in collisions with air mole¬ 
cules without resulting in ionization. This combined effect has been expressed 
in terms of the Townsend ionization coefficient. As a body becomes smaller, 
its curvature increases and the electric field intensity drops off more rapidly 
with distance from the surface; consequently, to accelerate electrons a given 
amount, the body surface field intensity must be higher than for a flat surface. 
Actually, because of increased attenuation resulting from the increased 
distance that an electron must travel through air to achieve a given accelera¬ 
tion, the required surface intensity must increase even faster. 

No attempts to calculate those limiting surface field intensities or gradients 
for small particles have been reported in the literature. However, point-to- 
plane corona has been studied in detail, Loeb (L8) has proposed a method 
for calculating positive corona threshold limits making use of established 
values of the Townsend first ionization coefficient. Loeb’s threshold formula is 

exp f a dx — 1 (58) 

where a is the first Townsend ionization coefficient; the integral 

f * 2 

I otdx 

is known as the Townsend integral. The exponential term represents the 
number of electrons (or ions) created in an avalanche originated by a single 

5 Actually, with flat plates, the first sign of breakdown is a spark and there is no initial 
corona. 
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electron. The term ( represents the fraction of these electrons that produce a 
photon capable of ionizing the gas photoelectrically, and £ is the chance 
that such a photon will give rise to a photoelectron that can be the source of 
a new avalanche. The term a is a function of both pressure and field intensity. 

Loeb et ah (LI 3) have shown that for the positive corona threshold the 
Townsend integral is essentially a constant having an average value of 10.2. 
Using this value for the integral, the field distribution relationships around a 
sphere, and the values of the ionization coefficient a (summary given by 
Loeb, L10, p. 676), the positive corona threshold limits for spherical particles 
have been calculated and are presented in Fig. 6 in terms of a normalized 
specific particle surface gradient, and in Fig. 7 in terms of charge-to-mass 
ratio. It will be noted that for p = 1 atm, the threshold charge limit on all 
sizes of liquid particles is determined by Rayleigh instability rather than by 
corona considerations, although for water (a = 72 dyn/cm) the difference is 
small for particles of the order of 100 microns. 

Negative corona is considerably more complicated and depends more on 
the emission properties of the discharge surface and of the gas. White (W4) 
gives a detailed discussion of corona mechanisms. In general, the negative 
corona threshold is higher than the positive. 

In Fig. 6, it will be noted that at p = 1 atm for particle sizes less than 10 
microns, the corona threshold rises rapidly. There is strong evidence that 
there is an upper limit to the specific particle surface gradient, £ ps , of the 
order of 200 V/micron, and that the rapidly increasing values calculated for 
the corona threshold cannot be realized. Earhart (El), for example, finds 
when surfaces are closer together than 2 microns that discharge occurs at a 
value of $ ps of 162 V/micron at atmospheric pressure and at even lower 
gradients for reduced pressures. Also, if the gradient reaches the order of 10 4 
V/micron, field emission from the surface can occur. There are also some 
other reports (Germer, G3; Kisliuk, K3) that suggest some type of field 
emission at gradients of the order of 100 V/micron. It has been suggested 
that these low apparent emission gradients may be the result of local intensi¬ 
fication due to surface roughness (Robertson, Viney, and Warrington, R6). 

It should be noted that the calculation of positive corona thresholds for 
spheres has been based on values of the Townsend integral for point-to- 
plane configurations. Thus, geometry may play an additional role that has 
not been allowed for. Also with particles smaller than 10 microns, the mean 
free path of electrons, molecules, and ions is being approached. If it is 
assumed that an electron or an ion can be accelerated without attenuation 
over a distance of its mean free path, then the specific particle surface gradient 
required to raise the energy level of the electron or ion to a potential of E i volts 
can be shown by a simple integration to be given by 

= (E,/AJ[1 + (2 lJD p y\ » {EJk J 


(59) 
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where l m is the mean free path. Since ionizing potentials are of the order 
of 20 V and mean free paths are of the order of 0.1 micron, it is clear that 
specific surface gradients of 200 V/micron can accelerate either electrons or 
ions to ionizing energy levels. It would seem, therefore, that specific surface 
gradients cannot exceed a value given by Eq. (59), if E { is defined as the 
ionization potential for the suspending gas and X m is taken as the mean free 
path of a gas molecule (or positive ion). It is interesting to note that this level 
checks the order of gradient reported by Earhart for surfaces that were very 
close together. This subject, however, requires further study. 

B. Contact Charging 

When two bodies are in contact with one another, a potential difference 
may be set up between the two bodies. This is known as a contact potential 
and is most commonly applied to conductors or semiconductors. If the bodies 
are physically separated, some of the charge segregation resulting from this 
contact potential will remain, causing the bodies to be oppositely charged; 
the rest may leak back during the separating process. The magnitude of this 
charge can be estimated from a knowledge of the contact potential and the 
capacitance of the system before separation. 

As previously discussed, for two equally sized, equally and oppositely 
charged conducting spheres, the capacitance is given by 

V = AJE = 2nk# tD p (48) 

where k s % is a function of the ratio (. LJD p ); L s is the clearance between the 
spheres; is the charge on each sphere; and E is the potential difference 
between the spheres. Values of k s% are given in Table VI if L s is known. From 
the definition of $ ps [Eq. (1)], Eq. (48) becomes 

i ps = 2 k«E!D p (60) 

It is also easily shown that the corresponding equations for a charged 
conducting sphere near a grounded conducting plane are given by 

% = £ p /E=4nk s vED p (61) 

$ ps = 4k s <# Ej D p (62) 

where £ p is the charge on the sphere; E is the potential difference between 
the sphere and the plane; and S ps is the average gradient corresponding to 
£ p on the sphere. The value of k s % may also be obtained from Table VI if L s 
is defined as twice the distance between the plane and the edge of the sphere. 
Thus, if E is the contact potential acting over a distance [L s for two equal 
spheres, (LJ2) for the sphere and plane], the charge can be calculated from 
the above equations if the separation distance is known. Actually, the effective 
distance over which E acts will be of the order of 1 A (or a portion of one 
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molecular diameter). Fortunately, an examination of Table YI shows that 
k s <# is very insensitive to L s and that this distance need not be known accurately 
to obtain k s<€ . Also some charge may leak back if $ max between the bodies 
is greater than the breakdown strength. For larger clearances it was pre¬ 
viously indicated that this breakdown would probably correspond to a value 
of <^ max of some 200 V/micron. However, for the extremely small clearances 
here involved, it is more likely that back discharge either will not occur until 
the potential difference between the bodies exceeds the ionization potential 
or will proceed by the electron tunneling effect. 

As the bodies are separated the potential difference will increase. How¬ 
ever, the potential difference at any separation will be directly proportional 
to the respective value of k s <#. Since this will not vary markedly, the potential 
difference between the bodies will not increase very much as they are separ¬ 
ated. Therefore, since contact potentials are normally considerably below 
ionization potentials, it is unlikely that back discharge will actually occur 
owing to air ionization. However, from Table VI and Eq. (47) it is also 
apparent that the maximum surface gradient may be of the order of 10 8 
V/cm (if £ = 1 V and separation distance is lA), which should be enough to 
cause field emission and discharge by this means. 

Thus the amount of back discharge cannot be readily calculated. It would 
seem unlikely, however, that the potential difference would drop below the 
contact potential as the bodies are separated. This would mean that the 
actual charge leaking off would be only a small fraction of that originally 
present. 

Assuming a value of k s % of 2 (which is a likely value from Table VI), it 
can be shown from Eq. (60) that S ps will be of the order of 4 V/micron for a 
1-micron-diam particle if the contact potential is 1 V. It is thus apparent 
that substantial charges can be generated by simple contact potential. 

The contact potential represents the difference between the work function 
of the two metals. Hence, the particle charge will be positive on that body 
which has the lower work function and negative on that body having the 
higher work function. 

Cho (C2) has calculated charging by contact potentials essentially on the 
basis described above, assuming a contact potential of 1 V for the case of an 
aluminum particle in contact with a steel plate. Arabadzhi (Al) reports a 
contact potential between ice and water of 0.15 V, although the abstract for 
another article by Arabadzhi (A2) reports this value as 1.5 V. 

C. Induction or Field Charging 

It is possible to charge a particle by placing it in an externally generated 
field. This external field will cause either electrons or ions that are present 
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in the surrounding space or on a contacting surface to flow to the body. 
This is sometimes known as induction charging. 

Two specific cases will be considered. The first involves a conductive 
particle resting on a charged plate. It has been shown (Cho, C2) that the 
average field at the surface of a spherical particle resting on a flat plate is 
(n 2 /6) times that at the surface of the charging plate. Thus, assuming that 
the particle is maintained on the surface long enough to reach this equilibrium 
condition, the particle will reach a charge level given by 


£ ps = (n 2 /W 0 = 1 * 65 ^ 


(63) 


where S Q is the field at the surface of the base plate (or the field between 
two plates with the particle resting on one of them). In actual fact the repulsive 
forces may cause the particle to leave the plate surface before the equilibrium 
charge is reached. Cho (C2) has investigated this method of charging experi¬ 
mentally using aluminum, iron, nickel, molybdenum, and magnetite particles, 
and essentially confirms Eq. (63) for particles in the size range of 0.3-15 
microns at field strengths, <f 0 , up to 2.5 V/micron. In the case of aluminum 
he found that when the plate was positive the aluminum particles received a 
slightly higher charge than when the plate was negative. The difference was 
of the order predictable for charging due to contact potential. He also found 
that conductor and semiconductor particles would continuously migrate back 
and forth between the charging plates. Insulator particles did not react 
immediately to the application of a potential across the charging plates, but 
some did hop off the charging plate after a period of time. This was attributed 
to the small but finite conductivity of such particles. Such particles did not 
migrate back and forth between the plates. Nonconductors can only be 
polarized in such a field. Fields around polarized particles are usually an 
order of magnitude lower than those resulting from charged particles. 

The second case of field charging that is considered is that of charging a 
particle located in a uniform field in the presence of a net concentration of 
unipolar gas ions. This case has been extensively treated by Deutsch (D4), 
Rohmann (R7), Ladenburg (LI), Pauthenier and Moreau-Hanot (PI) 
Mierdel (M5), and Cochet (C3). A good summary is given by White (W4). 
Under these circumstances, a particle will always reach a specific limiting 
charge, but the time required to reach this limit will depend on the concen¬ 
tration and mobility of the gas ions. The most general relationship is that 
given by Cochet (C3) 


W„J i 


(ft»iQ/4e)f 
+ (A«iQ/ 4 e) f 
2 


PSmax 


■([ i + 3)i a + r- - 2 — \* l ±]\ 

U UvJ Ll +(22 i /D.)J k + 2«5jj 


(64) 

(65) 
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where <^ pSmax is the maximum or equilibrium charge level; $ ps is the charge 
level attained after time t ; is the ion mobility; is the unipolar concen¬ 
tration; X t is the mean free path of the ions; S and S p are the dielectric con¬ 
stants for the fluid and the particle, respectively; and S 0 is the strength of 
the external field. The term in braces is a function of the particle and gas 
properties. Neglecting the effect of ion mean free path (which will normally 
be significant only for particles smaller than 1 micron), this term will vary 
from 1 for particles that are perfect insulators to 3 for particles that are 
conductors. 

The maximum charge is reached in a time that depends on the ion con¬ 
centration as indicated by Eq. (64). Ion concentrations of the order of 10 8 
ions/cm 3 are readily achieved by means of corona from wires and ion mobili¬ 
ties are of the order of 2 (centimeters/second)/(volts/centimeter). For these 
conditions 4 e/(/?,«/Q) is 0.011 sec. It is apparent from Eq. (64), therefore, 
that the particle will essentially acquire its maximum charge in less than a 
second. Since S Q can be of the order of 3 V/micron (breakdown strength of 
air), it is also apparent from Eq. (65), that particle charges corresponding to 
the range of 3-10 V/micron can be achieved by this means. 

With ambipolar ions, a net charging can still take place. The charging 
rate for this case can be approximated by letting the term (/?,«*) in Eq. (64) be 
equal to the difference between the corresponding terms for ions of each 
charge. Actually significant ambipolar ion concentrations may be short-lived 
because of ion recombination. 

The following relationship (White, W4) gives the electric field for con¬ 
centric-cylinder electrodes, as modified by the space-charge effect of the 
corona ions, 


* = LUJlmPd + (S c DJD ) 2 ] 1/2 (66) 

where $ is the field intensity at diameter D\ $ c is the field intensity at the 
wire of diameter D c {S c is the critical field intensity required for corona); 
j L is the corona current per unit of wire length; is the ion mobility. For 
large currents and for the points near the outer cylinder, the last term may 
become small and the field is closely approximated by 

£=[j L /(2nzp i )r 2 (67) 

Ion density is related to current by the basic relationship 

j A = HtQPt# (68) 

where j A is the current per unit area parallel to the electrodes (i.e., normal 
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to the electrical force field), is the ion concentration, is the ionic mobility, 
and £ is the field intensity. 

In order to attain a desired value of field strength, the current must be 
adjusted to the level indicated by rearranging Eq. (66) as follows: 

h = [1 - (£ c DJ£ Z)) 2 ] (69) 

where the term in brackets may often be neglected. This adjustment of current 
is actually achieved by regulating the voltage across the cylinders. Thus for 
£ = lV/micron, = 2 (centimeters/second)/(volt/centimeter), neglecting the 
term in brackets, a current flow of 3.4 mamp/ft of wire length is required. 

The corresponding ion density is obtained by combining Eq. (68) with 
Eq. (69) to yield 

hi = {2e£/(QD)}[\ - (£ c D C /£D) 2 ] (70) 

Thus for £ = 1 volt/micron, D = 2in., and neglecting the terms in brackets, 
Hi - 2.18 x 10 9 ions/cm 3 . 

Loeb (L10) gives a detailed discussion of the factors influencing ion 
mobility. The electrostatic mobility of a charged particle, jS pe , is defined as 
the ratio of the terminal migration velocity to the field strength applied. The 
electrical accelerating force acting on a particle is given by 

F e = £ p £ (71) 

where l p is the charge on the particle and £ is the external field. The force 

resisting motion is given by Stokes’ law of resistance. 

F r = 'infiuDpjkc (72) 

When F e and F r are equal and opposite, the particle has attained its terminal 
migration velocity, u m , 

u m = k c ££ p l(?>nnD p ) = k c z5D p £ ps £l3}i (73) 

or 

he = "j* = h 5p/(3 nnD p ) = k c s SD p SJln (74) 6 

Cochet (C3), in measurements with particles in the size range of 0.1-1 micron, 
found a minimum mobility and migration velocity at a particle diameter of 
micron. The following are some of his results for the minimum velocity 

6 It will be noted by comparison with Eq. (1) that electrostatic mobility, f3 pe , is directly 
proportional to specific particle potential, E ps . 
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and mobility at an ion concentration of 8 x 10 7 ions/cm 3 with equal charging 
and migrating field strengths: 


Field 

Charging 

Migration 

Electrostatic 

strength 

time 

velocity 

particle mobility 

(V/micron) 

(sec) 

(cm/sec) 

(cm/sec)/(V/micron) 

0.15 

0.04 

0.5 

3 

0.50 

o.ot 

3 

6 


Recently, Foster (FI) reported measurements on corona charging of 
wood-smoke particles of 0.1-0.3 micron, and obtained charge levels of 5-32 
electrons per particle (# ps = 1-5 V/micron). He concludes that these levels 
agreed well with predictions assuming that field induction and diffusion 
charging effects are additive. Migration velocities varied from 0.5 to 23 
cm/sec depending on the voltage and current applied. Over the limited range 
covered there was no significant effect of particle size on migration rate. 


D. Diffusion Charging 


Whenever ions are present in a gas, they will diffuse and deposit on any 
surface (bounding wall or suspended particle). As the surface becomes charged 
subsequent deposition of ions is retarded. The rate of charging will, therefore, 
drop off very rapidly as charging progresses. There is, however, no real 
limiting charge for this mechanism since, on a probability basis, there will 
always be an ion of sufficient energy to penetrate through the electrostatic 
field set up around the surface by previously deposited particles. 

Arendt and Kallman (A3) have derived the following relationship for 
the rate of charging a particle by diffusion of unipolar ions: 


d2, 

dt 



TiEUik.Dp 2 ^ _ \nD p 2 k s u i Qn i '\ _ f Q£ p \ 
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This equation may also be written in the form 


dfps 


dt 



8QD,*„ ) 

2k T I 


(75) 


(76) 


where l p is the charge on the particle after time t; S' pK , the corresponding 
specific particle surface gradient; u it the rms ionic velocity [given by 
(3kT/m i ) 1/2 or (3R7’/i /,) 1/2 ]; k s , a sticking coefficient (found to be essentially 
unity); /?,, the ionic mobility; and the ionic concentration in the bulk of 
of the gas phase. White (W4) has given a simpler derivation that yields a 
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result identical with Eq. (75) except that the term in square brackets on the 
left is omitted. For small particles, this term will normally not have any 
significant effect compared with that exerted by the exponential term. White’s 
equation also lends itself to ready integration to yield 




2nED p kT 


In 
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(77) 


or 
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(78) 


Table IX demonstrates the order of magnitude of the various terms, for 
D p = 1 micron, 5=1, n { = 10 8 ions/cm 3 , M t = 32 gm/gm mole, = 2 
(centimeters/second)/(volt/centimeter), k s = 1. 


TABLE IX 

Illustrative Values of Terms of Eqs. 75-78 



T = 298°K 

T - 1000°K 

Ui , m/sec 

482.0 

883.0 

(UikJAhpi), V/micron 

0.602 

1.103 

(2k T/SQDpX V/micron 

0.0513 

0.1723 

(k s u t tiiQ/Ae&X (V)/(micron)(sec) 

218.0 

399.0 

(2Tre/) p k77Q), C 

1.430 x 10- 18 

4.80 X 10" 18 

(SekT/k s Hi rii Q 2 Z> P ), sec 

2.355 X 10- 4 

4.32 x 10“ 4 


The above equations do not allow for kinetics associated with the mean 
free path discontinuity at the particle surface. A correction for this would 
increase the charging rate. As a first approximation, this can be allowed for 
by replacing D p in Eqs. (75) and (77) with the product D p [\ + (2A f /Z) p )]. 
This is based on the reasoning that ions migrating to within a mean free path 
of the particle surface will be deposited and that the ion concentration will 
drop effectively to zero within a mean free path of the surface. 
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1 + 


2A, 

D„ 


(79) 


Thus, since S ps will be relatively insensitive to the items inside the logarithmic 
term, the main effect of this correction is to increase the value of S ps or J2 P 
by the factor [1 + (2A*/Z> p )]. 

The values in Table X have been calculated from Eqs. (77) and (78) for 
/i. = 10 s ions/cm 3 , u t = 482 meters/sec (corresponding to 0 2 ), T— 25°C, 
<5 = 1, neglecting the mean free path correction factor. 
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TABLE X 

Illustrative Values of Diffusional Charging of Particles 


Particle diameter 
(microns) 

Charging time 
(sec) 

Particle charge 

^P <*ps 

(electrons) (V/micron) 

0.1 

0.01 

1.5 

0.85 


0.1 

3.4 

1.94 


1 

5.4 

3.11 


10 

7.5 

4.29 

1.0 

0.01 

33.6 

0.19 


0.1 

54.0 

0.31 


1 

74.6 

0.43 


10 

95.1 

0.55 

10.0 

0,01 

540 

0.031 


0.1 

746 

0.043 


1 

951 

0.055 


10 

1157 

0.067 


It is apparent from the values of S ps that diffusional charging is primarily 
effective with very small particles, as might be expected. With particles as 
large as 10 microns the charge level produced in any reasonable time is small 
compared with what can be achieved with corona charging. For the 0.1- 
micron particle, however, the charge level attained by diffusion is of the 
same order and may be larger if the mean free path correction is allowed for. 

Whitby (W3) has developed an ac ion generator specifically for the purpose 
of producing electrically neutral particles. This neutralization is accomplished 
by adding a gas stream loaded with gaseous ions (both positive and negative) 
to the aerosol to be neutralized. Although exact neutralization cannot be 
assured by this means, actual neutralization to within a few electrons has 
been achieved. This is probably due to the fact that diffusion charging (dis¬ 
charging in this case) is not sensitive to ion concentrations that appear only 
in the logarithmic terms of Eq. (77). 


E. Thermionic Emission Charging 

All materials emit electrons to an extent dependent on their temperature. 
Such emission of electrons will cause a body to become positively charged. 
This effect does not usually become significant until the temperature levels 
are above the normal incandescent levels, which are to some extent related 
to this phenomenon. Einbinder (E3) reports that the ionization levels are 
given by 
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(a) for njn p very small: 



(b) for njn p ^ 3: 



where n e is the concentration of electrons in space; n p , the concentration of 
particles in space; and cp, the work function for the particle surface. These 
equations assume that equilibrium is reached between the spatial concen¬ 
tration of electrons and particles. Thus the number of electrons emitted must 
be equal to the number of positive charges left on the particles. Since 
<S ps - (n e /n p )(Q/nz8D p 2 ), the above can be written in terms of the average 
value of S ps to give 

(a) for n e /n p very small: 



(b) for njn p S 3: 

2kT ( nz 5D p 2 £ \ = 

SQD P \ Q ) 

Q 2k T 

2tie Jd} + 8QD p " 



(83) 


Table XI demonstrates the order of magnitude of the various terms, for 
D p = 1 micron, 3 = 1, q> = 5 eV. 


TABLE XI 

Illustrative Values of Terms of Eqs. 80-83 



r=iooo°K 

2000°K 

T = 3000°K 

(277-m e kT/h' ( ) 3/2 , particles/cm 3 

0.760 x 10 20 

2.15 x to 20 

3.95 x 10 20 

277e/) p k77Q 2 , dimensionless 

30.0 

60.0 

90.0 

9 /kr, dimensionless 

58.0 

29.0 

19.32 

e -vihT, dimensionless 

6.47 X 10- 26 

2.54 x 10- 13 

4.03 x 10" 9 

QI7tb8D p 2 , V/micron 

0.00576 

0.00576 

0.00576 

2kTj8QD p , V/micron 

0.1723 

0.346 

0.518 
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For n p in the range of 10 6 -10 10 particles/cm 3 , it is evident from these 
values that significant charging does not occur by thermal emission until 
temperatures are of the order of 2000°K or over. 

The above indicates the charge attained in the presence of the electrons, 
the cloud being electrically neutral. As the cloud is cooled or diluted, the 
cloud will remain electrically neutral but some recombination of electrons 
with the charged particle may occur to reduce the net level of particle charge. 
In the above, ionization of the gas phase has been neglected. Such gas 
ionization would also occur either at high temperature or at high particle 
charge level and could exert a significant interactive effect on the process of 
particle charging by thermal means. 

At elevated temperatures emission of ions can also occur but this normally 
requires an even higher temperature level than does electron emission. 

F. Charging by Interface Alteration 

When materials are brought into contact and then separated or when a 
bulk material is separated by formation of a new surface, the two separated 
bodies usually acquire an equal and opposite charge. The sign and magnitude 
of charge on each body depend on a large number of factors, which are 
(1) the physical and chemical composition of each body; (2) the nature of 
contacting (gentle contact, impact, or rubbing); (3) the method and speed of 
separation after contact; (4) the presence of contaminants or ions on the 
surface; (5) the nature of the atmosphere surrounding the bodies; and (6) the 
presence of any external electrostatic fields. The final charge observed is the 
residual charge left after separation. The charge present just before separation 
takes place could be higher because of the back discharge that may occur 
during the separation process. The final observed charge separation is, there¬ 
fore, the net result of both a charging process and a discharging process. 

All available investigations in this area are essentially reviewed in recent 
survey articles by Harper (H3, H4), Loeb (L9, Lll, L12), and Montgomery 
(M7). This area has been the subject of investigations since ancient times. 
Despite the extensive investigations made in this area, considerable uncer¬ 
tainty and confusion remain as to the details of the charging processes 
involved. Table XII gives a summary of the types of systems that are subject 
to charging as the result of interface alterations and the possible mechanisms 
that may be responsible for the charging process. The subsequent sections 
discuss the magnitude and mechanism of charging in each of these cases in 
more detail. 

1. Liquid Breakup 

The charge separation resulting from the alteration of a liquid-gas inter¬ 
face has been variously termed spray electrification and balloelectricity and 



TABLE XII: Processes and Mechanisms Involved in Chargincf by Interface Alteration" 


General process 

Special case or conditions 

Possible charging mechanisms 

Liquid breakup 

High resistivity liquids 
(@o > 10 10 ohm cm) 

Ionic liquids 

(^o < I0 10 ohm cm) 

Statistical distribution due to ion concentration fluctuations 
(most probable charge = 0) 

Double layer disruption 

Unequal ion mobility or migration rates 

Interface contaminants (inch surfactants) 

Liquid-solid contact 

Liquid-liquid contact 

Statistical distribution due to ion concentration fluctuations 

Double layer (zeta potential) disruption 

Volta potential (for electron conducting materials) 

Electrolytic (galvanic) potential (for ionic systems) 

Solid-solid contact 
(inc. solid breakup) 

Metal to metal 

Metal to semiconductor 

Semiconductor to semiconductor 

Volta potential (equalization of Fermi levels) 

Electrolytic potential (where adsorbed water films may be present) 


Metal to insulator 

Semiconductor to 
insulator, or 
Insulator to insu¬ 
lator 

Light contact 
(touching) 

Ion migration (due to inherent or unavoidable ion contamination; 
electron traps 

(a) by random adhesion of ions for contact of dissimilar materials 

(b) by diffusion due to differences in ion concentration or mobilities 

(c) by image attraction 

Anomalous (due to avoidable surface contamination) 



Intense contact 
(compression 
or rubbing) 

Same as for light contact plus the following 

Preferential local heating (to cause ion migration or electron transfer) 
Rupture of chemical bonds 

(Asymmetry of either surface structure or nature of contact or both 
is necessary to achieve charging) 

Solid-gas contact 

Liquid-gas contact 

No known charging provided solid or liquid does not break up and 
gas contains no free ions or suspended particles 


a Covers charging due to relative motion or separation of two contacting phases in the absence of external electrostatic fields, radiation, 
or thermionic emission. The superimposition of such effects may serve to confuse the data in many cases, especially those involving natural 
phenomena. 


LAPPLE 
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involves the formation of charged droplets as the result of bubbling, splashing, 
spraying, and aerodynamic shattering. Some of these operations, such as 
splashing, could involve a liquid-solid interface as well. This type of charging 
has been of historical interest in connection with apparent atmospheric 
electricity around waterfalls and currently in connection with establishing 
the method of generation of thunderstorms. 

Charge levels produced by this mechanism on individual droplets are 
relatively small, normally corresponding to £ ps values of less than 0.01 
V/micron, when there is no concentration of charge due to evaporation of 
liquid in the drop. Dodd (D5), in generating organic liquid drops of from 
1 to 30 microns in diameter with a nebulizer, found that the charge followed 
a Gaussian distribution with zero charge being the most probable (i.e., equal 
quantities of positively and negatively charged drops). The average absolute 
charge was proportional to the § power of particle diameter (or $ ps propor¬ 
tional to the inverse square root of particle diameter). His results for 1-micron 
drops would correspond to a range in <f ps values of 0.0018 V/micron for 
paraffin oil to 0.024 V/micron for dibutylphthalate. For larger droplets the 
i ps values would be lower. Natanson (Nl), using a compressed air nebulizer 
to produce 1- to 4-micron transformer oil drops, obtained similar results. 
The larger drops were almost all charged but only 10-75% of the smaller 
drops were charged. In general he obtained charges of 0-30 electrons per 
particle with an average charge corresponding to a value of £ ps covering the 
range of 0.002-0.006 V/micron. These levels agreed with magnitudes that 
would be calculated on the basis of statistical fluctuations in the local ion 
concentration levels. Harper (HI) reports that for liquids with resistivities 
greater than 10 13 Q • cm, no charging on bubbling was observed. 

With the more conductive liquids, the ion concentration becomes so great 
that ion concentration fluctuations on a statistical basis are likely to be small. 
However, charging can take place by three other mechanisms: (1) mechanical 
disruption of any double layer of ions that may exist at the surface in times 
that are short compared with the relaxation time, with a predominance of 
the surface ions going to the portion of fluid coming from the surface; (2) 
unequal ion mobility with the larger ions unable to return to the bulk of 
liquid as readily as the smaller and more mobile ones; and (3) contaminating 
materials, such as dust or surfactants at the interfaces serving as ion carriers 
into one portion or the other of the ruptured liquid. 

The nature and magnitude of charge resulting from liquid breakup are 
illustrated by the following representative investigations. Blanchard (as 
quoted by Loeb, L9) reports charges corresponding to i ps values of 0.02-0.05 
V/micron when droplets of 4-40 microns are formed as the result of the 
breaking of air bubbles in salt water. When exposed to external electrostatic 
fields of 0.005-0.030 V/micron, the charge levels were increased by an order 
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of magnitude. Because of droplet evaporation, it is possible that the actual 
initial droplet sizes were larger than reported by Blanchard. This would make 
the actual corresponding $ ps values smaller. Loeb (L9) reports that when 
bubbles are allowed to rise through many centimeters of sea water (as in 
Blanchard’s case), the resultant droplets are positive. However, when allowed 
to rise through about 1 cm of water, the drops are negative. Matthews and 
Mason (M3) report negligible charges as the result of aerodynamic breakup 
of 15-mm-diam water drops in the absence of an electrical field. Charges of 
the order of 3 x 10" 9 C/kg (corresponding to an $ ps value of 0.001 Y/micron 
for a 15-mm-diam drop) were found. With external fields ranging from 0.003 
to 0.15 V/micron, charges of 10~ 7 -2 x 10“ 6 C/kg (or <f ps for 15-mm drop 
of 0.03-0.5 V/micron) were obtained. A reversal of the field also reversed the 
sign of the charge on the fine drops formed. 

2. Liquid-Solid Contact 

When liquids come into contact with a solid surface a transverse potential 
difference may be set up at the interface because of a preferential adsorption 
of some ions on the surface with a resultant electrical double layer. This is 
commonly called a zeta potential If a relative motion is set up between the 
liquid and the solid surface, a charge displacement will take place. This will 
result in a buildup of potential difference in the direction of motion until 
the resultant back migration of the unbound ions just balances the rates at 
which they are transported forward by the relative fluid motion. This poten¬ 
tial difference is called a streaming potential Conversely, if an external 
potential gradient is imposed along the interface or across a porous boundary, 
the resultant charge displacement will set up a relative force between the 
liquid and the wall. If the liquid is not otherwise constrained, it will move 
relative to the solid. This is called electrical endosmosis If the liquid is 
constrained and the solid is free to move, the resultant motion of the solid 
body through the liquid is called electrophoresis (or cataphoresis in older 
literature). Endosmosis rates are established by the equilibrium between the 
driving force and any frictional resistance to motion. Endosmosis will cease 
if the hydrostatic head of transported liquid is allowed to build up to a point 
where the driving force is balanced. Similarly electrophoresis rates will be 
established by the equilibrium between the driving force and frictional forces 
on the moving solids. 

When a low conductivity liquid containing some ions (and there are 
almost always some present in all liquids) is forced through a conduit under 
a high pressure gradient, enormous voltages can be generated. This condition 
is treated mathematically by Cooper (C6) who cites a critical resistivity of 10 n 
Q • cm above which voltages high enough to result in sparking to ground 
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can be generated, with resultant fire and explosion hazards, during industrial 
transfer of such liquids from one vessel to another. 

It should be noted that the above phenomena all involve relative motion 
or charge segregation but no static electrification or true charge separation. 
The latter can only occur when the liquid is separated from the solid or is 
broken up. The above phenomena consequently can only create conditions 
that will permit charge separation. Any net charging process itself will, 
therefore, be very similar to that involved in liquid breakup except that the 
initial charge segregation will be influenced by double layers at the solid 
surface as well as by those at a liquid-gas interface. 

The net charge resulting from a simple liquid-solid contact is likely to 
be small, as for liquid breakup. However, if the liquid-solid contact is 
extensive, as in the flow through a long capillary, the potential difference 
buildup could cause extensive charging of any liquid drops formed at the 
end of the pipe. This charging, however, is a result of the electrostatic field 
between the conduit and surroundings. Thus, although this field is a direct 
result of the liquid-solid contact, the droplet charging is only an indirect 
result since it could have been obtained by direct application of an external 
electrostatic field to the discharge end of the conduit. 

With higher ion concentrations or electron-conductive liquids, the effect 
of the zeta potential is essentially superseded by electrolytic (galvanic) or 
volta potentials, respectively. Although these potentials are of the order of 1 V 
while zeta potentials are one or two orders of magnitude lower, the high 
conductivities of the more conductive liquids make it normally impossible to 
build up the high streaming potentials that are possible with low conductivity 
liquids by virtue of the zeta potential. 

Vonnegut et al (V3) impinged deionized water (^2 0 = 10 6 Q • cm) against 
an aluminum plate. The resulting 2-mm-diam drops had a positive charge 
of 4 x 10“ 14 C/drop (corresponding to R c = 1 x 10“ 8 C/kg or £ ps = 0.0004 
V/micron). 

Mercury is a special type of liquid and behaves essentially as a metal for 
which charging mechanisms are discussed in Section V, F, 3. Dodd (D5) 
determined the charge acquired by mercury drops in contact with glass and 
various insulators. The mercury was always positively charged with S ps 
values of the order of 0.01 V/micron. 

Medley (M4) working in vacuum also found that mercury became posi¬ 
tively charged relative to nylon, mica, and Alkathene 63 film. He obtained the 
contact by bringing a column of mercury up against a film of the other 
material, the back of which was covered with a stationary film of mercury 
connected to an electrometer, which measured the charge after the contact 


6a Alkathene is a trademark of I.C.L and is a form of polyethylene. 
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was broken. By varying the thickness of the film, he measured a maximum 
charge of the order of 500 esu/cm 2 (equivalent to a value of $ ps , as defined 
by Eq. (1) for a value of £ ps of 500 esu/cm 2 , of 180 V/micron). 7 When smalt 
amounts of sodium, zinc, lead, or tin were added to the mercury, the sign of 
charge was reversed but the maximum charge level observed was the same. 
This suggested that actual surface charges were initially much higher than 
those measured, the latter being limited by considerations of back discharge 
during separation. 

Liquid-liquid contact will be basically similar to liquid-solid contact. In 
this case, both phases would be mobile, however, and some of the extreme 
charge separations possible in liquid-solid contacts would not be possible. 

3. Solid-Solid Contact 

The terms contact electrification and triboelectrification 8 have been 
applied to the process where charge is acquired when two solids are contacted 
and separated. It is in this area of static electrification that the greatest con¬ 
fusion exists. This confusion comes about because of the complications 
produced by: (1) the variety of mechanisms that can operate (electrolytic films, 
ion transfer, electron transfer); (2) the sensitivity to impurities or surface 
contamination; (3) the manner in which contacting is achieved (normal, 
sliding, rubbing, or rolling contact); (4) the manner of making measurements; 
(5) the presence of external or extraneous electrical fields; (6) variations in 
ambient atmosphere; and (7) back-discharge phenomena during separation. 
In order to understand the basic mechanisms involved, it is informative to 
consider first the current theories of solid state structure. 

a. Theory of Solid State Structure. The modern concept of structure of 
solids visualizes a discrete atom as composed of a nucleus surrounded by 
electrons in distinct energy levels, the number of electrons and their energy 
levels being different for different materials. When atoms are brought into 
proximity to form molecules or crystals, the corresponding electrons in each 
original energy level will be forced to occupy a band of energy levels instead 
of the original single energy level because of the Pauli exclusion principle 
(which states that no two electrons can occupy the same quantum energy 
state at the same time). The outermost or highest energy electrons are the 
valence electrons. Thus, for a proximate assemblage of atoms, this highest 

7 The value of £ ps so calculated would be equivalent to the field existing outside of a 
particle the entire surface of which has the specified charge surface density. Because of the 
stationary mercury film behind the material film, the actual field outside of the material 
film will be considerably less than £ ps . 

8 From the Green word tribein meaning “ to rub.” 
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energy level reverts to a valence band rather than being a discrete level, the 
spread in the bandwidth becoming larger the closer the spacing of atoms. 

In order for an electron to travel from atom to atom (i.e., for electron 
conduction to take place), the electron must first be raised to an unoccupied, 
allowable higher energy level (or band). For assemblages of atoms this is 
called the conduction band. 

With conductors either the valence band is only partially filled (as with 
metals) or it overlaps with the next higher allowable empty band. Hence 
only small amounts of energy are required to raise a valence electron into a 
higher-level empty band where it is free to move from atom to atom. 

With insulators or semiconductors, the valence band is completely filled 
and is separated from the next highest allowable energy band (conduction 
band) by a gap (or forbidden band). In order for valence electrons to be 
raised into the conduction band, they must be given a sizable amount of 
energy. With insulators this gap corresponds to several electron volts. For 
semiconductors the gap is sufficiently small so that a significant number of 
electrons can acquire the necessary additional energy by thermal means at 
room temperature. 

Electrons in thermal equilibrium in the solid state obey Fermi-Dirac 
statistics as given by 

/ p = 1/(1 + e (<PT~<PF)/kT^ (8 4 ) 

where f p is the probability that an electron will occupy an allowed quantum 
energy level <p T at temperature T. The term cp F is known as the Fermi level 
and represents the energy of the highest occupied level at 0°K; (p F is usually 
very large compared to kF. Thus at 0°K,/ p drops abruptly to zero a t(p T = cp F . 
For higher temperatures, the Fermi level corresponds to that energy level cp T 
at which the probability of occupancy is With metals, there actually exists 
an occupied energy level corresponding to the Fermi level. With insulators, 
it is a hypothetical energy level above which electron traps due to imperfec¬ 
tions will probably be empty and below which they will probably be full. 
This results from the fact that the Fermi level lies in the forbidden band for 
insulators. 

b. Mechanisms of Charge Transfer . When two materials are brought to¬ 
gether, the Fermi levels for each must equalize to give a common Fermi 
level for the combined system. Thus electrons will flow from the material 
with the higher initial Fermi level to that with the lower. The material with 
the higher initial Fermi level, therefore, becomes positively charged. For 
metals, the difference in Fermi levels is equal to but opposite in sign to the 
differences in the work functions (the energy required to remove an electron 
from the surface of the metal). Thus, when two metals are contacted, the 
one with the higher work function becomes negative. This type of contact 
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charging, however, is possible only if there is an allowable (i.e., not for¬ 
bidden) unoccupied energy level in the material with the lower Fermi level 
to receive the electrons from the material with the higher Fermi level. This 
will always be the case with metals and some semiconductors. However, 
as pointed out by Vick (V2), the Fermi level of most insulators is below 
that of metals but the conduction band is above that of the Fermi level 
for metals (i.e., the Fermi level of the metal lies in the forbidden band for 
the insulator). Thus no electrons can flow from the metal to the insulator 
since there is no allowable energy band open and no electrons can flow from 
the insulator to the metal because the insulator electron energy levels are too 
low. In unusual cases, the conduction band for the insulator is below the 
Fermi level for the metal so that electron flow from the metal to the insulator 
can occur. With two insulators in contact, electron transfer could only occur 
if the conduction band for one were below the valence electron level (maxi¬ 
mum filled energy level) for the other, an unlikely condition. Strictly speaking 
the above statements are valid only for 0°K where no electrons exist in the 
conduction band. At any finite temperature some electrons will occupy the 
conduction band to an extent dictated by Fermi-Dirac statistics. Thus there 
can always be flow between the conduction bands although the rate may be 
insignificant until temperatures become very high. 

If the temperature of one insulator is raised (as by rubbing), electrons 
may be transferred to the conduction band or the band levels may be altered 
to an extent that would permit appropriate electron flow. The presence of 
surface states may also alter the general picture. Such states, acting as addi¬ 
tional levels within the forbidden band for trapping electrons, may originate 
in various ways, including imperfections of the lattice structure at the surface 
and the presence of other adsorbed atoms. 

Thus the charging resulting from metal-metal contact is readily explained 
in terms of modern concepts of structure and would follow the principles 
described above in Section V, B. For perfect and pure insulators the same 
theory would predict that charging by simple contact may be very slow. 
However, the alteration of surface states, either by the presence of imperfec¬ 
tions or contaminants or by intense contact (pressure or rubbing), could explain 
significant charging by electron transfer. Harper (H3), for example, has shown 
that simple contact between amber and a metal, silica, or polystyrene, will 
yield no significant charge whereas rubbing is known to give a very large 
charge. 

With metal-metal contact, it is also possible to obtain an electrolytic type 
contact potential if adsorbed water films are present. Deaglio (as quoted by 
Harper, H3), in investigating the contact potential between silver and nickel 
balls, observed that when he carefully controlled the gap between the sur¬ 
faces the sign of charge reversed at a gap width of the order of 100 A. This 
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he explained by the formation of an electrolytic cell as the result of water 
adsorption. 

Charging of contacting bodies could occur by transfer of ions between 
the surfaces rather than electrons. This would require, however, that free ions 
be present in the surface. If one in every 100,000 surface molecules were an 
ion, the charge level would correspond to the order of S ps = 1 V/micron. The 
presence of such small concentrations of surface ions is not unreasonable to 
account for the charge levels that have been observed. Harper (H3) prefers 
such ionic transfer as the probable explanation of the charging of insulators 
on light contact. The early experiments by Knoblauch (K5) suggest an ionic 
transfer mechanism. He determined the charge acquired by a plate of platinum, 
glass, or sulfur when various substances were allowed to slide off of it. In 
general, the plates were charged positively by acidic substances and nega¬ 
tively by basic substances. 

From the above, it would be expected that, except for metal-metal con¬ 
tacts, charging can be very sensitive to the method of contacting and to the 
surface condition (or state of contamination) of the material. This is, in 
general, borne out by the seeming contradiction of available data, which are 
based on widely different methods of contacting and degrees of cleanliness. 

Van Ostenburg and Montgomery (VI) have reported calculations of the 
surface charge density that might be expected when contacting specific types 
of materials. Their calculations are based on thermodynamic equilibrium in 
which the Fermi level for the entire system becomes constant and ignore 
consideration of the time that might be involved in achieving equilibrium 
when depending on chance transfer of electrons through the conduction band 
by virtue of thermal energy. In many of their cases the calculated charge levels 
are extremely high (>0.01 C/meter 2 or <f ps > 1000 V/micron). This might 
imply that final charge levels are determined largely by considerations of 
back discharge upon separation of the surfaces. Montgomery (M7) and 
Harper (H3) believe that back discharge is mainly due to electron tunneling 83 
rather than air breakdown. 

The general concept that work functions were somehow involved has led 
a number of investigators to set up a triboelectric series in which any sub¬ 
stance in the series will acquire a positive charge if contacted or rubbed by a 
material lower in the series and a negative charge if rubbed by one higher in 
the series. Shaw and Jex (S8) claim it is impossible to devise a simple series. 
To illustrate this they found that they could obtain a series (most positive 
given first)—filter paper, cotton, glass, zinc, silk, filter paper—that is actually 


8a Electron tunneling is the process whereby electrons are emitted from a body as the 
result of the probability of an electron penetrating a potential barrier according to quantum 
mechanics considerations. 



TABLE Xni 

Typical Triboelectric Series (Most Positive at Top of List) 


WILCKE (1757) . 

BALLOU (19S4)* 

HERSH AND MONTGOMERY (1955)* 

HENRY (1953) 


d by Montgoaery ( 1959>J 

(1SXRH, ^27°C) 

(33SRH. 30°C) 

( 3856RH. 22°C) 

BEACH (1965) 


Polished glass 

Wool flock- 

Writer’s quill 

Wood 

Paper 

Sealing wax 
White wax 
I'npolished glass- 
Lead 

Sulfur - 

Other metals * 


— Wool - 

Nylon - n 

Silk - l 

Viscose rayon -’ 

f Cordura rayon 

1 

(•■Hyman skin 

I 

* Cotton - * 

K - - 

— “ Fiberglas" 

Spun ramie 
Chromium (plate) 
Cellulose acetate-^ 
Dacron yarn 4 

Orion yarn 1 ' " ' J 

Polyethylene ■' > 

Saran - 7 —' 


-r* Wool 

i 

Lfc, Alumijium, magnesium “ 

'-Nylon 

r♦ 

Viscose rayon 

^••Tron. platinum - 

Cotton 

C-Silk 

j -Cellulose acetate — 

Lucite 

Polyvinyl alcohol 
f— Dacron 
/— Orion 
Dvnel 
Velon 

Polyethylene 

Teflon 


“Platinum-l 

Formvar (polyvinyl formal) 1 
Filter paper (Whitman No. 1) 
“Cellulose acetate 
Cellulose triacetate 
Polythene (polyethylene) 

— Aluminum- 

Polysty rene 
Copper (tarnished) 

Rubber (dental dam) 


Flint glass, chamois leathe 
Plate glass 

Borosilicate crown glass 

Zinc, flannel, aluminum (sc 
linen 

Quartz 

Cobalt glass 

“Gelatin film, steel, cadmium, bismuth 
lead, silk, aluminum (oxidized), 
de Khotinsky cement, nickel, Norway 
iron, bees wax, ebonite disk, 
sheet rubber “E” 

Shellac 

Paper towel, ebonite plate 
Copper oxide, sheet rubber''D'’ 

—Soft rubber tube, sulfur 
Sheet rubber “C” 

Amber, sheet rubber “B" 

— Antimony, tin, mercury, copper, silve 
gold, celluloid sheet, iron pyrites 
brass, sheet rubber - ‘A'\ platinum, 
gun cotton, collodion film, zylomt 





GANOT (1917) . 

GRUNER (1953) , 

[as quoted by Sheshoua ( 1 958 )J 

ROSE AND WARD ( 1957) 


SRSBEE (1942) 

l.as quoted by Guest (1933)J 

FUKADA AND FOWLER (1958) 

Asbestos 

Cat skin 

Wool +42 

Ethyl Cellulose 

Mi ca 

Glass 

FIannel 

Perlon II +20 

Casein 

PSWA^ (unplasticized) 

Mica 

Ivory 

Dacron +14 

Perspex 

PAWA (red) 

Woo L 

Rock crystal 

Paper +12 

Tufnol 

PMAA (plasticised) 

Cat's fur 

Gi ass 

Glass, steel +10 

Ebonite 

Amber (natural) 

Lead 

Cotton 

Nylon +7 

Cellulose acetate 

Amber (molded) 

Silk 

Silk 

Cotton +5 

Glass 

Polystyrene (British) 

Aluminum 

Human hand 

Brass 0 

All metals 

Polystyrene (USA) 

Paper 

Wood 

Orion -4 

Polystyrene 

Polyethylene 

Cotton 

Metals 

Hard rubber -14 

Polyethylene 

Polytetrafluorethylene 

Wood, iron 

India rubber 

(bibber -20 

Teflon (polytetrafluorethylene) 


Sealing wax 

Sealing wax 

(numbers above are contact 
potentials against brass) 

Cellulose nitrate 


Ebonite 

Resin 




Nickel. copper i silver, brass 

Sulfur 




Sulfur 

Gutta-percha 




Platinum, mercury 

Gun cotton 




India rubber 






♦ 

5 


Ballou points out that there appears to be a relation between structure of insulators and their position in the aeries. Aside groups (wool, nylon, silk) are at the positive end; hydroxy 1 *rich 
polymers are in the middle; hydrocarbons and haiogenated hydrocarbons (polyethylene, sarin) are et the negative end. At 60% relative humidity the positions in the series were the same except 
that " Fiberglas" moved to a position between wool and nylon. Also a fire-polished glass rod rubbed with silk is charged positively. When the surface was roughened with emery paper, the rod took 
on a negative charge >hen rubbed with either siLk or orlon. 

Arrows indicate that metal show.i is somewhere in range indicated by extremea of dashed line. These data for metals are baaed on scraped surfaces. Ihe data for unscraped (presumably oxidized) 
metals were somewhat lower (mo.-e negative) in the series. 


is polvmethylmethacrvlate. 


FVMA 
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cyclic. Such results could reflect different predominating mechanisms (as, for 
example, contact potentials vs electrolytic effects). Table XIII lists a number 
of such series. Because of the widely different substances covered by each 
investigator, it is difficult to compare the results directly. Some of the sub¬ 
stances common to several of the series have been connected by lines to give 
some indication of the level of agreement. It should be noted that the inclina¬ 
tion of these connecting lines does not indicate disagreement. It is only when 
these lines cross between adjacent series that a discrepancy is implied. 
Although there are a number of discrepancies, the general agreement is good, 
especially considering the variety of ways in which these data were obtained 
and considering the problems discussed above. One of the worst discrepancies 
is that for platinum reported by Henry. It has also been rather generally 
agreed that charge levels are not additive even if a valid triboelectric series 
should exist. It is possible, however, that this conclusion is the result of 
back-discharge processes rather than initial charging processes. 

Based on electroendosmotic experiments, Coehn and Raydt (C4) came 
to the conclusion (later known as Coehn’s rule) that the material of higher 
dielectric constant always assumes a positive charge and that the charge is 
proportional to the difference in dielectric constants. This can be written in 
the form 


^ s = K 5 (S,-d 2 ) (85) 

where d l and d 2 are the dielectric constants of the two materials rubbed 
together and K d is a constant with a value of 1.48 x 10" 5 C/meter 2 , based on 
data reported by Richards (R4). 

If an entire particle is assumed to have a surface charge density equal to 
(i.e., £ ps = l s ), this may be written in terms of i ps as 

**. = **&-«,) W 

where K 5e is a constant equal to 1.67 x 10 6 V/meter (1.67 V/micron) based 
on Richard’s data. 

Actually Coehn’s rule is not generally applicable. In order to obtain the 
seeming correlation above, Richards had to resort to the artifice of assigning 
a value of 3.1 to the dielectric constant of steel. Nevertheless, in the absence 
of any other basis, Coehn’s rule can be used as a guide to the sign of charge 
that may be expected when contacting various insulators. It is also interesting 
to note that positive ion transfer by virtue of image forces resulting from di¬ 
electric constant differences between two contacting surfaces would account 
for Coehn’s rule that high dielectric constant materials become positively 
charged in contact with low dielectric materials. An exception could be 
expected when the substance of high dielectric constant normally contains a 
very high concentration of positive surface ions. 
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c. Typical Data. The literature on solid-solid charging is immense and 
covers all conceivable ways in which solids may be contacted. The following 
are summaries of some representative results in various areas to illustrate 
both the nature and the magnitude of charges attained. 

(1) Simple Contact : Harper (H4) indicates that typical charge levels 
obtained in contacting surfaces range between 3 x 10” 6 and 3 x 10” 5 C/ 
meter 2 (equivalent to S° ps = 0.4-4 V/micron) 9 ; with powders, charge densities 
may be an order of magnitude lower. 

In experiments involving the contacting of quartz spheres having specific 
orientations of their crystal faces, Harper (H2) attained local charges that 
were estimated to be as high as 0.029 C/meter 2 . If this charge density had 
existed over the entire sphere, the corresponding value of $ ps would be 
3300 V/micron. 9 Actually, for this localized charge, the surface field intensity 
would be only some 600 V/micron. 9 He attributes the failure for this charge 
to leak off by air breakdown to the extremely small volume of air exposed to 
the charge and the corresponding low probability of finding a free electron 
in this volume to initiate gas breakdown. 

Harper (H3) gives an analysis which he claims shows that the final charge, 
when contacting two metals, is not significantly affected by the speed with 
which the surfaces are separated, although he presents too few details of his 
analysis to permit a critical evaluation of it. In his analysis he assumed that 
transfer due to electron tunneling could occur during the separation step. 
Bredov and Kshemyanskaya (B13) report that the charge that flows away 
from the contacted surfaces is reduced by a factor of 4 as the speed of breaking 
the contact is increased by a factor of 2. They claim that a theoretical calcu¬ 
lation, assuming the presence of electron tunneling, agrees with their experi¬ 
mental results, but present no details of either this experiment or their 
calculation. It is implied that the contact was between two metals. Their 
data also show essentially a direct relationship between surface charge level 
and the contact potential difference for metal-to-metal and metal-to-semi- 
conductor contacts. 

(2) Compressive Contact : Rose and Ward (R8) made measurements of 
charges resulting from intensively pressing to l-in.-diam dielectric spheres 
against flat metal or dielectric plates. They concluded that the amount of 
charge was directly proportional to the actual contact area, regardless of 
the time or intensity of the contact, the latter simply determining the area of 
contact. This could be construed to indicate that ion rather than electron 

9 The <? ps values are calculated assuming that the entire particle has a surface charge 
density corresponding to that over the contact area (i.e., assuming £ ps — £ s ). Where the 
charge exists only over a limited surface area, the actual field intensity just outside of this 
area will be less than these values of £ ps by a factor [S/(8 + S p )]. This local field intensity 
will drop even lower as the charge has time to be conducted to other regions of the particle. 
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transfer is involved or that the final charge levels are determined primarily by 
considerations of back discharge. The area of contact was separately 
measured by coating the surface with a thin layer of soot and found to agree 
with predictions from Hertz’s (H9) theory of sphere deformation on com¬ 
pression. Maximum areas of contact were 0,03 cm 2 (or less than 1 % of the 
total sphere surface). Although they obtained a consistent triboelectric series 
as shown in Table XIII, they conclude that charge levels in the series were not 
additive. The charge attained when dielectric spheres were pressed against 
metal was independent of the nature of the metal and ranged from 4 x 10" 5 
to 3 x 10" 4 C/meter 2 of contact area (corresponding to S ps values of 4.5-30 
V/micron). 9 When dielectric surfaces were pressed together, charge levels of 
7 x 10" 5 to 2 x 10" 4 C/meter 2 were obtained (£ ps = 7.5 to 20 V/micron). 
By way of comparison, Richard’s (R4) values ranged from 5 x 10" 6 to 
7 x 10" 5 C/meter 2 (corresponding to $ ps values of 0.6-8 V/micron). 9 Rose 
and Ward also found no significant charging when like dielectric surfaces 
were pressed together. Although Rose and Ward claim some correlation of 
charge level with the lowest dielectric constant of the surfaces pressed to¬ 
gether, their plots of data are not convincing on this score. With dielectric- 
metal contacts, they did find the highest charge levels when the dielectric 
resistivity was greater than 10 11 Q * cm. 

(3) Pneumatic Conveying Contact : Kunkel (Kll) pneumatically conveyed 
silica, sulfur, starch, talc, and nickel powders through quartz, Pyrex, sulfur, 
nickel, and platinum tubes. The highest charge level reported was 2700 
electrons for a particle 6.6 microns in diameter blown out of a platinum tube 
(corresponding to S ps = 0.4 V/micron). Whenever contact between unlike 
substances was avoided (e.g., blowing quartz powder out of a quartz tube), 
there were an equal number of positive and negative charges. With unlike 
materials, the charge distribution may become asymmetric. In the most 
asymmetric case the ratio of charges of opposite sign was 4:1. When insu¬ 
lators were impacted on metals, the charge was always asymmetric, with the 
insulator particles having net negative charges. Changing relative humidity 
up to a level of 90% had no appreciable effect. The total charge acquired by 
particles was proportional to between the first and second power of particle 
diameter although there was a very wide scatter in the data. 

Berg and Flood (B8) and Berg, Fernish, and Flood (B5, B6) made 
measurements on the charging of Carbowax 6000 9a and saccharin (with and 
without 1 % Cab-O-Sil additive), Cab-0-Sil, 9a MgO, and NH 4 C1 when passed 
pneumatically through a 0.8-mm-diam stainless steel tube (see Section VI for 

9a Carbowax and Cab-O-Sil are trademarks of commercial materials. Carbowax is a 
series of polyethylene glycols manufactured by Union Carbide. Cab-O-Sil is a superfine 
silica from Cabot Corp. 
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details of equipment), at velocities of 3-15 meters/sec. Cab-O-Sil was pre¬ 
dominantly negatively charged (ratio between negative and positive charge 
as much as 12:1). The charge for the others was about evenly split (within a 
factor of 2) between positive and negative. The charge level for the Carbowax 
and saccharin was in the range of 0.001-0.01 C/kg, while for Cab-O-Sil it 
was between 0.01 and 0.3 C/kg. A 1 % addition of Cab-O-Sil to Carbowax or 
saccharin had only a small effect on charge. The charge level increased as the 
first to second power of gas flow rate but was independent of any initial 
charge on the particles fed into the tube. Mass median diameters were given 
as follows: Carbowax 6000, 14 microns; Cab-O-Sil, 0.02 microns; saccharin, 
7 microns. Based on these diameters, the charge levels would correspond to 
values of 0.3-3 V/micron for Carbowax and saccharin and 0.004-0.1 
V/micron for Cab-O-Sil. The low values of i ps for Cab-O-Sil despite the high 
charge-to-mass ratios are undoubtedly due to failure to disperse the Cab-O- 
Sil to anything approaching its ultimate particle size. With MgO and NH 4 C1, 
charge levels were much higher, 0.1-1 C/kg with one value of 50 C/kg reported. 
Assuming a particle diameter of 1 micron (the nominal discrete particle diam¬ 
eter reported), these charge levels would correspond to S ps values of 2-20 
V/micron with the one extreme charge corresponding to 1000 V/micron. 
Any failure to disperse the particles would mean that the effective value of 
S ps would be even larger. It seems unlikely that the extremely high value 
reported could be real. 

Taneya (Tl) measured the voltage within the cloud formed when skim 
milk was hydraulically atomized. This voltage varied from positive to negative 
as the distance from the nozzle increased. From this, he concluded that the 
small particles are charged positively, and the large ones negatively. It should 
be noted, however, that his probe (which was measuring voltages of the order 
of 0-300 V) need not necessarily reflect only the space charge. In pneumatic 
conveying of dried skim-milk powder, he found that the powder became 
negatively charged and that the duct potential (0-2000 V) increased with both 
concentration of particles in the air stream and with air velocity in the duct. 

(4) Sliding Contact : Taneya (Tl) also allowed five kinds of powder 
(dried skim milk, dried milk, dried buttermilk, flour, and a-lactose hydrate) 
to slide down a grounded V-shaped chute and measured the charge of indivi¬ 
dual particles in a modified type of Millikan cell. The average particle size 
ranged from 30 to 48 microns, and the average absolute charge from 1 to 
4x10" 14 C/particle, with corresponding S ps values of from 0.4 to 0.7 
V/micron. From one-third to one-half of the particles were uncharged, and 
the rest were approximately evenly divided between positive and negative 
particles. The net charge of the powder was positive in all cases and corres¬ 
ponded to charge-to-mass ratios of 3 x 10" 6 to 4 x 10" 5 C/kg. Particles 
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larger than 40 microns did not appear to be charged. This was believed to be 
due to flocculation with smaller particles to achieve charge neutralization. 
He also found that the net charge-to-mass ratio appeared to increase with 
fat content of the material. 

(5) Rolling Contact : A number of investigators have studied charges 
developed during rolling contact. This process is useful for assessing certain 
charging mechanisms. Peterson (P4, P5) rolled 2-mm borosilicate glass spheres 
and 6-mm quartz spheres over nickel at a variety of ambient air pressures and 
rolling speeds. Both types of spheres acquired a negative charge, which was 
at a minimum for an air pressure of about 1 Torr absolute. This was inter¬ 
preted as indicating that at significant air pressures (of the order of 1 Torr and 
over) the maximum charge that the spheres could acquire is limited by dis¬ 
charge due to gas breakdown. Wagner (Wl) conducted a similar series of 
experiments using a wide variety of sphere materials rolled on nickel at very 
high vacuum (10" 5 Torr) in an attempt to measure maximum charge levels 
without being limited by the question of discharge by gas breakdown. When 
rolled on nickel, aluminum oxide became negatively charged while the alkali 
halides became positively charged. Clean magnesium oxide (periclase) was 
positively charged but became negative after heating to 1200°C. This was 
attributed to loss of excess oxygen to produce the stoichiometric compound 
on heating, since it regained its original positive charge after successive polish¬ 
ing. Both Peterson and Wagner found a maximum rate of charging at rolling 
speeds of some 10-15 cm/sec. The increase in rate of charging with rolling 
speed is expected because of the equilibrium that is established between the 
contact charging and the discharging due to surface or bulk conductivity of 
the spheres. The reduced net charging rate at the high speeds was attributed 
either to a change in surface state as a result of the more intensive contacting 
or to a local heating effect with an attendant increased surface conductivity. 
From a detailed study of the maximum charges acquired at high vacuum, 
it was concluded that there was one case (silica) where there was electron 
transfer of a type related to the contact potential of metals. In other cases, 
there were indications of both ion transfer and electron transfer not related 
to work function charging. 

Berg (B4) and Berg and Gaukler (B9) measured the charge attained by 
5-mm-diam borosilicate glass beads when they were rolled down a 24-in-long, 
10-mm-i.d. Pyrex tube at various frequencies and at various humidities. In 
all cases, the bead charge increased with the number of times it was rolled, 
usually reaching a maximum charge after some 400 cycles or rollings. At 
relative humidities below 1.5%, the initial bead charge was random (half of 
the beads positive, half negative), but the beads all eventually acquired a 
negative charge. At humidities of 1.5% and over, almost all the beads were 
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charged negatively at the start, but became positive by the time the maximum 
average charge was reached. At humidities of under 1 % and over 35 %, the 
average maximum absolute charge attained was approximately 2 x 10“ 9 C 
per bead, while at intermediate humidites, it was 1.7 x 10“ 8 C (corresponding 
to values of S ps of 3 V/micron and 25 V/micron respectively). It would appear 
from these results that the beads accumulated charge up to a maximum 
level dictated by the point where charge is lost owing to either air ionization 
at the high charge levels or surface conductivity at the high humidities. 
The charge level of 25 V/micron is difficult to explain, however, since it is 
almost an order of magnitude larger than the field intensity needed for air 
breakdown from a 5-mm bead. 

(6) Miscellaneous Data : Shaffer (S5) reported net charges for BZ, 9b 
Carbopol, 9c and saccharin of 10~ 5 C/kg after working them with a spatula. 
BZ was positive, the others negative, and only about half the charge dis¬ 
appeared after 6 days storage. He also measured the average charge of 
saccharin power (1- to 6-micron particles) after spraying from a DeVilbiss 
atomizer to be of the order of 0.001 C/kg (corresponding to $ ps ^0.05 volt/ 
micron). The sign distribution was 25% positive; 64% negative; 11% 
neutral. The resistivity of saccharin was measured at 10 15 Q, • cm. 

Fukada and Fowler (F5) established a triboelectric series by lightly 
rubbing specimens of insulators together (see Table XIII for their results). 
They made no attempt to obtain quantitative measurements of charge 
because of the different geometries of the bodies used, but did attempt to 
correlate the series with corresponding measurements of the number and 
distribution of electron traps for each of the substances. While it was con¬ 
cluded that those substances with the greatest number of traps and the steep¬ 
est distribution would become negatively charged, the data reported are too 
erratic to be convincing. 

The importance of surface states is demonstrated by the data of Thomas 
(T2) who found that the charge acquired by coal when contacted with iron 
was a function of its degree of oxidation. Raw coal had a positive charge. 
As the coal became oxidized (by exposure to air at 350°C for various lengths 
of time), the coal acquired less of a positive charge, finally becoming strongly 
negative with a highly oxidized coal. This phenomenon was proposed as the 
basis for a method of process control during coal carbonization. Brasefield 
(B12) found similar results for carbon blacks when allowed to slide down a 
grounded nickel surface. Blacks with a low oxygen content acquired a large 
positive charge, while blacks with a high oxygen content became negative. 

9b BZ is a chemical incapacitating agent. Its composition is security classified. 

9c Carbopol is a thickening agent from B. F. Goodrich Chemical Company. 
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The charge was zero at a 4% oxygen content. Similar phenomena in the 
inorganic field have previously been discussed in connection with Wagner’s 
(Wl) results on magnesium oxide. 

4. General Comments 

From the above discussion it is apparant that, in the absence of external 
electrostatic fields, charging of liquids by means of contacting or interface 
alteration mechanisms are capable of yielding only low charge levels, corres¬ 
ponding to S ps values of less than 0.1 V/micron, probably less than 0.01 
V/micron. With solids however, relatively high charge levels (S ps =1 — 10 V/ 
micron) can be attained. Although some values have been reported that are 
radically higher than this, there is reason to question their validity. 

Although true homopolar charging is basically possible in contact charging 
of solids, and has been obtained with single bodies, it has never been achieved 
with powders, which have always yielded a significant amount of ambipolar 
charges. This is because of the contact that invariably occurs between like 
particles, either in suspension or as the result of some particles adhering to 
and coating pipe walls. From a practical standpoint it seems doubtful that 
homopolar charging of powders can be realized by these mechanisms. The 
closest approach to homopolar charging is probably an extreme asymmetric 
charging. 

It should also be recognized that conducting particles are charged on the 
first contact; repeated contacts will not yield added charging (assuming that 
the particle shape is not altered in the process). With insulators however, 
successive contacts will permit additional charging until the maximum possible 
stable charge exists over the entire particle surface. With conducting particles 
it is possible that flattened shapes will permit high charge levels to be attained. 
Distortion during contacting will not be useful in this respect unless the elastic 
limit is exceeded so that the distortion is permanent. 

It seems fortuitous that the same order of maximum charge levels has been 
reported for metal-metal contacts as for insulator-insulator contacts. This 
might indicate that back discharge could be a controlling factor in establishing 
final charge levels. 

In a case where charge is acquired by rolling, the surface conductivity 
of the particle may be important in establishing the final level of charge 
acquired. Thus an insulator of high conductivity (say 0t^< 10 10 Q • cm) may 
not only be incapable of acquiring the high local charge possible with a 
metal-metal contact, but, because of its high conductivity, may not benefit 
from successive contacts along other parts of its surface. On the other hand, 
the fact that its conductivity is high would imply a high concentration of 
surface ions. Thus, if charging is due to ion transfer, a higher local charging 
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may be possible. Current data are not yet adequate to resolve questions of 
this type. 


G. Charging by Phase Change 
1, Solidification or Freezing 

A number of investigators have reported a net electrification of ice 
crystals formed as the result of the freezing of water droplets. Mason and 
Maybank (Ml), Latham and Mason (L6), Latham (L4), and Evans and 
Hutchinson (E4), have reported charge levels corresponding to values in 
the range of 0.004-0.02 V/micron. Kachurin and Bekryaev (Kl) report values 
of the order of 0.1 V/micron. In all cases, the fine ice crystals are positively 
charged. The most likely explanation of this phenomenon is that proposed by 
Latham and Mason (L5). They suggest that charging is not a direct result 
of the freezing process but that charge displacement occurs because of the 
combined effect of different ion mobilities and temperature gradients. Jf a 
temperature gradient exists within a drop, there will also be a gradient in 
degree of water ionization, the concentration of both H + and OH" ions 
being greatest in the region of highest temperature. Because of the much 
higher mobility of the H + ion, it will diffuse more rapidly than the OH - 
ions toward the colder regions of lower ionic concentration. This displacement 
will set up a potential difference, which will then induce a backflow of H + 
ions. An equilibrium potential difference or charge segregation will be reached 
when the back-flow rate equals the forward diffusion rate. A net charge 
separation, however, cannot be realized until the drop is shattered. While 
this phenomenon should occur with either a liquid or solid across which 
a temperature gradient exists, the difference in mobilities between OH - and 
H + ion is greater in ice than in water. 10 

Since the outside of the drop will be the coldest and the first to freeze, 
it will tend to contain an excess of H + ions with the compensating OH” 
being concentrated at the center of the drop. When the drop shatters as the 
result of stressess set up by freezing, it will send out fine crystals from the 
surface and these will be positively charged. Latham and Mason (L5) have 

10 Based on data reported by Eigen and De Maeyer (E2), the mobilities at (FC in units 
of (cm/sec)/(V/cm) are 

Type ion In water In ice 
H + 0.0024 0.1-0.5 

OH' 0.0011 <0.05 
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predicted that relatively high charge levels can be attained with sufficiently 
high temperature gradients. 

2. Condensation 

Very few data are available on possible charges arising as the result of 
condensation. Dubois (D8) made measurements on the evaporation of and 
condensation on 20- to 60-micron droplets consisting of water solutions of 
various salts. He reports the appearance of a positive charge on the droplet as 
the result of condensation and negative charge as the result of evaporation. 
The charge level corresponded to $ ps values of the order of 0.01 V/micron or 
less. Dubois attempts to explain this on the basis of the formation of ions by 
rupturing of molecular bonds during evaporation, with the electrons remain¬ 
ing in the liquid phase. He gives no reason, however, why the positive ions 
should go preferentially into the vapor phase. 

H. Other Methods of Charging 

There are two other methods by which particles can become charged. These 
both involve emission of electrons or ions: photoemission and field emission. 
Photoemission results from the bombardment of the particle surface by 
electromagnetic radiation. Field emission is the result of subjecting the particle 
surface to a high electric stress (field intensity). 

Visible light or other electromagnetic radiation incident on a solid, liquid, 
or gas can liberate electric charges. This is called photoelectricity. Ejection of 
electrons from the surface is usually called photoemission. Electrons or 
positive ions formed in a gas as the result of such radiation is called photo¬ 
ionization. Such a process, however, cannot charge a particle directly. The 
charging process in that case is a direct result of subsequent diffusion. 

When a material is subjected to an electric field intensity greater than 10 7 
V/cm (1000 V/micron), electrons may be emitted. At intensities greater than 
10 8 V/cm (10 4 V/micron), ions may be emitted as the result of electron 
bombardment while at 5 x 10 8 V/cm direct field desorption may occur. 

Photoemission is unlikely to be a significant contributor to the charging 
of particles. Field emission, however, could be involved in some of the inten¬ 
sive electrostatic atomization processes where very fine highly charged particles 
are produced. Because of the dubious current applicability, no effort has been 
made to review the literature in these cases. 

Several other mechanisms could be construed as potential means for 
charging particles, but they are all secondary factors that depend on some 
other mechanisms for achieving a charge on the particle. Nuclear fragments, 
for example, may produce gaseous electrons, ions, or photons. It is then these 
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that actually result in particle charging by means of diffusion or emission. 
Photoionization, previously mentioned, is in this category. The use of radio¬ 
active substances for alleviating problems with static electricity in industry 
are also in this category. The radioactivity ionizes the surrounding gas, 
which then acts to neutralize any free charges by diffusion and electrostatic 
attraction. Shock waves can produce some form of radiation or temperature 
rise, which then permits charging by one of the previously described mechan¬ 
isms. Chemical reactions may generate ions (or free radicals), which are then 
available for diffusion charging of particles or direct charging if the reaction 
is allowed to take place on the particle surface. 


VI. Charge Measurement Techniques 

Particle charge can be determined directly by measuring either (1) the 
current flowing from (or to), or the change in potential of, a surface from which 
the particles are disseminated or upon which they are deposited, or (2) the 
space charge due to the presence of the particles. Particle charge can also be 
determined indirectly by measuring the behavior of the particles in an electric 
or electromagnetic field. The direct measurements will normally measure 
particle charge by itself, whereas the indirect measurements will measure some 
combination of particle charge with some other particle property (such as 
diameter or mass). Thus, to obtain charge alone by the indirect methods 
requires another independent measurement of the other particle property 
included. Some of the indirect measurement methods are designed to do this 
in the same apparatus; others are not. 

The charge on particles may be measured individually or collectively. 
The individual measurements will give a complete charge distribution if a large 
number of particles are measured. Collective measurement may measure the 
net charge (i.e., algebraic sum of the individual charges), the total charge 
(i.e., sum of the individual charges regardless of sign), or the charge distribu¬ 
tion (i.e., relative quantity of positive, negative, and neutral). Such measure¬ 
ments may be made to yield an average value of charge for the entire aerosol 
or to yield a complete charge distribution. 

Various types of measurements are outlined in Table XIV. Table XV 
gives a summary of many of the specific techniques that have been reported 
in the literature, indicating the general nature of each technique and what it 
measures. The following will describe each technique in more detail, approxi¬ 
mately in the order that it is mentioned in Table XV. 

Hendricks (H5) and Cho (C2, Method No. 1) both measured particle 
charge by measuring the voltage pulse on an oscilloscope due to the passage 
of a charged particle through a drift tube detector. The particle mass was 
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TABLE XIV 

Types of Measurements for Determining Particle Charge 


I. Direct 

A. Measurement of Current 

1. Current flowing from the surface (or electrode) due to continuously depositing 
particles 

a. permanently deposited 

b. momentarily deposited 

2. Current flowing from a surface (or electrode) from which particles are con¬ 
tinuously disseminated 

B. Measurement of Voltage 

1. Due to cumulative charge 

a. from deposition of particles on a surface (or electrode) 

b. from dissemination of particles from a surface (or electrode) 

2. Due to instantaneous charge 

a. from an enclosed space charge 

b. from momentary deposition of particles 
II. Indirect 

A. Measurement of Particle Path 

1. In quiescent fluid 

a. in dc field 

b. in ac field 

2, In moving fluid 

B. Measurement of Balancing Forces on Particles 

1. Static (Millikan cell) 

2. Dynamic (quadrupole mass spectrometer) 

C. Measurement of Particle Deposition 


determined by measuring the velocity attained by the particle when it was 
accelerated through a known distance by a known potential gradient. A high 
vacuum was used in this apparatus. 

Hopper and Laby (HI 4) used a modified version of the Millikan cell. 
A charged particle falling between two parallel vertical plates is photographed 
both in the absence and in the presence of an electric field. In the absence of 
the field a vertical track is obtained (provided no convective air currents 
are present). By using a timed, intermittent illumination, the particle settling 
velocity can be calculated from this track. In the presence of the field, the 
particle will have a path that is inclined to the vertical. The angle of the path 
with the vertical is a direct measure of the charge-to-mass ratio. The direction 
of the inclination gives the sign of the charge. Particle mobility is obtained 
from the combination of the charge-to-mass ratio and settling velocity. 
It could, however, have been obtained directly by measuring the horizontal 
drift rate in the presence of the field. Kunkel and Hansen (K12) and Dodd (D5) 
used the same basic technique. Kunkel and Hansen improved the optical 
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system to permit simultaneous determinations for a large number of particles. 
They also state that the entire apparatus must be maintained isothermal 
within ±0.01°C to avoid excessive convection currents. 

Wells and Gerke (W2) used a method similar to that of Hopper and Laby 
(HI4) except that the field was reversed at a fixed low frequency (~1 cps), 
giving the equivalent of a low-frequency square-wave alternating current. 
This caused the particle to pursue a zigzag path, the amplitude of which is 
a direct measure of the particle mobility. Any convection currents can be 
detected by a drift in the zigzag path. Natanson (Nl) and Solov’yev (S10) 
used this same basic method. This method does not permit the sign of charge 
to be established unless a prolonged period of dc operation is superimposed. 

Taneya (Tl) also photographed the fall of particles between two vertical 
parallel electrodes using a 50-cycle alternating current. The mobility of the 
particles in the electrostatic field is obtained from the horizontal amplitude 
of the particle path and the current frequency. The gravitational settling ve¬ 
locity is obtained from the wavelength of the vertical path traversed and the 
frequency. Sign of charge was determined by a momentary application of a 
dc potential. This method minimizes sensitivity to convection currents. 

Goyer, Gruen, and LaMer (G7), Shaffer (S5), and Doyle, Moffett, and 
Vonnegut (D6), measured particle charge-to-mass ratio by balancing the 
particle, with an electric field counteracting the effect of gravity. They all 
used either the classical Millikan cell procedure, or some modification thereof. 
A separate measurement in the absence of the electric field is needed to obtain 
particle size. In the Millikan cell, this is obtained by measuring the gravity 
settling velocity of the identical particle in the absence of the electric field. 
Thus, in effect the Millikan cell measures charge-to-mass ratio and mobility. 
Doyle et al , (D6) measured drop size by a calibrated filter-paper absorption 
technique of other particles assumed to be of the same size. 

Vonnegut et al. (V3), in addition to reporting the use of the same method 
described by Doyle et al (D6), suggested a modification in the same category 
that would permit measurement of both charge and mass. Because of the 
ionic image effect, the electric field required to balance a charged particle 
against the action of gravity will become less as the particle approaches the 
top electrode. They show that, by measuring the necessary balancing force 
at two clearances between the particle and the electrode, both particle charge 
and particle mass are established, provided the clearance is also measured in 
each case. There is, however, no report of any actual application of this 
method. 

Shapiro and Watson (S6) and Frickel (F2) considered a device which in 
principle is similar to the quadrupole mass spectrometer to be discussed 
later. In this device individual particles are suspended and balanced in an 
alternating field in such a way that they cannot escape. The frequency of 



TABLE XV 

Summary of Charge Measurement Techniques Reported in Literature 


Type of Analysis 


Type of 
Charge 
Measure¬ 
ment + 

Scope of Charge 
Analysis 

Quantity Actually Measured** 

Pressure 

(atm) 

Author(s) 

Year 

Ref. 

Gives 

Single 

Average 

Values 

Only 

Gives 

Distri¬ 

bution 

Data 

Charge 

<v 

Charge 

to-Mass 

Ratio 

ty-p) 

Charge 

and 

Mass 

(VV 

Electro¬ 

static 

Mobility*** 

(2 /d ) 

P P 

I B 2 a 


* 



X 


~ 10” 8 

Hendricks 

1959 

H5 

I B 2 a 


* 



X 


~ 10” 8 

Cho (Method l) 

1964 

C2 

II A 1 a 


* 


X 


X 

1 

Hopper,Laby 

1941 

H14 

II A 1 a 


* 


X 


X 

1 

Kunkel,Hansen 

1950 

K12 

II A 1 a 


* 


X 


X 

1 

Dodd 

1953 

D5 

II A 1 b 


* 


X 


X 

1 

Wei1s,Gerke 

1919 

W2 

II A 1 b 


* 


X 


X 

1 

Natanson 

1949 

N1 

II A 1 b 


* 


X 


X 

1 

Solov*yev 

1957 

S10 

II A 1 b 


* 


X 


X 

1 

Taneya 

1963 

T1 

II B 1 


* 


X 


X 

1 

Goyer.Gruen.LaMer 

1954 

G7 

II B 1 


* 


X 


X 

1 

Shaffer (Method l) 

1962 

S5 

II B 1 


* 


X 



1 

Doyle.Moffett,Vonnegut 

1964 

D6 

II B 2 


* 


X 



~ 10” 8 

Shapiro,Watson 

1963 

S6 

U B 2 


* 


X 



1 

Frickel 

1964 

F2 

I Ala 

X 




X 


1 

Penney.Lynch(Method l) 

1957 

P2 

I Ala 

X 




X 


1 

Hewitt (Method 1) 

1957 

H10 

I Alb 

X 


X 




1 

Soo et al. 

1964 

Sll 

I A 2 

X 




X 


1 

Graf 

1962 

G8 

I B 1 a ' 

X 




X 


1 

Shaffer (Method 2) 

1962 

S5 

I B 1 a 

X 




X 


1 

Masters 

1953 

M2 

I Bib 

X 


X 




1 

Whitman j 

1926 

W5 

II B 2 


X 


X 



~ 10” 8 

Schultz , Branson 

1959 

S2 

II B 2 


X 


X 



-V. 10' 8 

Schultz, Wiech 

1960 

S3 

II B 2 


X 


X 



~ 10” 8 

Hogan 

1963 

H12.H13 

II B 2 


X 


X 



- 10” 8 

Cho (Method 2) 

1964 

C2 

II B 2 


X 


X 



- 10” 8 

Cohen 

1964 

C5 

II C 


X 




x ; 

1 

Penney.Lynch(Method 2) 

1957 

P2 

I Ala 

X 




X 

^ ”! 

1 

Berg.Fernish.Flood 

1964 

B6 

I Ala 


X 




X 

1 

Daniel.Brackett 

1951 

D1 

II A 2 


X 




X ! 

1 

Gillespie.Langstroth 

1952 

G4 

II A 2 


X 




X 

1 

Langer , Radnik 

1961 

L3 

II A 2 


X 




X 

1 

Sergiyeva 

1958 

S4 

II A 2 

X 





X 

1 

Hewitt (Method 2) 

1957 

H10 

II A 2 

X 





X 

1 

Kraemer,Ranz 

1952 

K8 

II A 2 

X 





X 

1 

Suzuki.Tomura 

1962 

S13 

II A 2 

X 





X 

1 

Hinkle , Orr , DallaValle 

1954 

Hll 


Particles are 

measured 

individually 


Particles are 

measured 

collectively 


(a) Net charge 
only 


(b) Positive, 
negative, 
and neutral 
fractions 
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oscillation of these particles is a measure of their charge-to-mass ratio. 
Shapiro and Watson gave only a theoretical analysis. 

Penney and Lynch (P2, Method No. 1) and Hewitt (H10, Method No. 1) 
collected aerosol in a fibrous filter. They measured the current flow to ground 
from this filter during the deposition and the total mass of aerosol collected 
during the run. 

Soo et al (SI 1) allowed the particles, moving in a duct at high velocity, to 
impinge on a |-in.-diam stainless steel ball. The discharge current to ground 
from the J-in ball was measured. In interpreting the results it was assumed 
(without being proven) that each particle is discharged completely before 
being reentrained from the ball and that no recharging of the particles occurs 
because of contact potential effects. Several other assumptions were also 
made but were less basic in nature and are subject to calibration or correction. 

Graf (G8) measured the total particle charge by observing the current 
flow to or from the metallic needle from which liquid was being atomized 
electrostatically, assuming that all this charge is transmitted to the liquid 
drops. The mass of atomized liquid was determined by weighing the amount 
subsequently collected on a grounded plate. 

Shaffer (S5) collected the particles in a Faraday cage and measured the 
change in potential and the change in mass of the cage to establish particle 
charge and mass. Masters (M2) used a similar procedure using a filter paper 
inside of a duct, letting the duct act as a Faraday cage. Both methods are 
similar to those used by Penney and Lynch and by Hewitt except that a total 
voltage change was measured rather than an integrated current flow in order 
to obtain total charge. 

In Whitman’s (W5) experiments, particles became charged by passing 
the aerosol through a metallic tube. The charge acquired was measured by 
noting the rise in potential of the tube by an electroscope and assuming the 
corresponding charge was all transferred to the particle. This is in effect 
similar to Graf’s (G8) technique except that Graf measured current rather than 
voltage. 

Schultz and Branson (S2), Schultz and Wiech (S3), Hogan (H12, H13), 
Cho (C2, Method No. 2), and Cohen (C5) all used the quadrupole mass 
spectrometer. This involves passage of accelerated aerosol particles between 
four longitudinal electrodes in high vacuum. These electrodes have imposed 


Footnotes to Table XV: 

* Individual particles are measured; hence distribution data are obtained by measuring 
a number of particles. 

** Does not include measurements which authors may have made by a separate 
apparatus or technique. 

*** Actually measures k c ^ P l[jiD p , thus including some fluid properties as well, 
t See Table XIV for details. 
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on them an oscillating electric field adjusted to allow only particles of a given 
mass-to-charge ratio to penetrate longitudinally. A high vacuum (~10“ 8 
atm) is essential to the operation of this device since any dampening factor 
(such as fluid friction) interferes with the discriminating ability of the instru¬ 
ment. 

Penney and Lynch (P2, Method No. 2) passed an aerosol between two 
parallel plates with an adjustable potential across them. The collection effici¬ 
ency achieved is a measure of the average electrostatic mobility of the particles. 

Berg, Fernish and Flood (B6) passed an aerosol between two shielded 
parallel electrodes with a potential of +1000 and —1000 V, respectively. 
The clearance between the electrodes was 4 mm. The positively charged 
particles are caused to migrate toward the negative electrode, the negatively 
charged ones toward the positive electrode, and the neutral particles are 
collected in a filter. The magnitude of the charge is determined by measuring 
and integrating the discharge current from each electrode. The total mass is 
determined by weighing the deposit on each electrode and on the filter. Air 
flow rates were 50-500 cm 3 /min and the total mass collected was 0.1-2 mg 
during a run. The air was introduced through a 0.8-mm-i.d. tube at velocities 
of 1.5-15 m/sec. Since runs were of the order of 1 min, aerosol concentrations 
were of the order of 1,000-10,000 ^g/liter (corresponding to 10 6 -10 7 particles/ 
cm 3 for 1 -micron-diam particles). No significant quantity of uncharged particles 
was ever found. 

The apparatus of Daniel and Brackett (Dl) was in principle similar to that 
of Berg, Fernish, and Flood (B6) except that they passed the aerosol through 
two sets of parallel plates in series, ^ in. apart in each set. Potentials across 
the plates were varied over the range ± 800 V. The second series of parallel 
plates is essentially a basis for checking results from the first set. The current 
from each set of plates was measured as a function of the applied voltage. By 
differentiating the curves obtained in this fashion, it is possible to obtain a 
distribution in terms of electrostatic particle mobility. This technique is 
essentially the electrostatic counterpart of the continuous gravity settling 
chamber or of the Oden sedimentation balance. 

Gillespie and Langstroth (G4) allowed a flat aerosol stream, surrounded 
by a shielding gas stream of like velocity, to flow between two parallel 
electrodes. The particles were collected on the electrodes and examined 
microscopically. The position on the electrode is a measure of mobility. 
The additional independent measurement of size gives a means for calculating 
the corresponding charge. A distribution is obtained by counting particles 
along the entire electrode length. Langer and Radnik (L3) used a similar 
device. 

Sergiyeva (S4) employed a modification of Gillespie and Langstroth’s 
method in which the aerosol is allowed to flow across the entire width of the 
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apparatus. However, the aerosol first flows between a series of charged 
parallel plates where aerosol particles are precipitated from all but a thin 
stream. He prefers this procedure to the use of a shielding stream to minimize 
evaporation problems with liquid drops. 

Hewitt’s (H10, Method No. 2) method is similar to that of Gillespie 
and Langstroth except that, instead of collecting the particles, the actual 
concentration of aerosol at various points along the electrode was measured 
by passing it through a slit that could be moved. 

Kraemer and Ranz (K8) also used equipment similar to that of Gillespie 
and Langstroth. However, since they were dealing with essentially homopolar 
aerosols, the aerosol jet was simply deflected from the axis by a fixed angle. 
They photographed the jet deflection and determined mobility from the angle 
so observed. Suzuki and Tomura (SI3) give a theoretical analysis of the 
Kraemer and Ranz spectrometer. 

Hinkle, Orr, and DallaValle (Hll) passed a flat aerosol jet between two 
cylindrical electrodes, with the electrodes parallel to each other and to the 
plane of the aerosol jet. The jet was caused to spread toward the electrodes and 
was photographed. From the lateral displacement and optical intensity, they 
were able to obtain a distribution between positive, negative, and neutral 
particles and an approximate value of average charge. 

It is interesting to note from Table XV that all methods that have been 
used to obtain charge distribution data by means of measurements on collec¬ 
tions of particles at atmospheric pressure involve a measure of electrostatic 
particle mobility. Other aspects of charging have only been obtained with 
those methods in which individual particles are evaluated. 

The particle deflection due to an electrostatic field alone is a direct measure 
of electrostatic particle mobility. This assumes that periods of particle acceler¬ 
ation are negligible, which is usually the case for particles of interest in 
aerosol dissemination. If this deflection is compared with that due to the 
gravity field, a direct measure of charge-to-mass ratio is obtained. Such a 
comparison is possible in all those techniques involving particle motions in 
a quiescent fluid. With a flowing fluid, however, the fluid velocity is usually 
so large compared with the gravitational settling velocity that the mobility 
due to the electrostatic field is the only item that can be measured directly. 

It should also be noted that in those methods in which charged particles 
are deposited on an electrode under the influence of an electrostatic field, 
there exists the possibility of reverse contact charging of the particle if the 
particle is sufficiently conductive. Such contact charging can cause the particle 
to oscillate between the electrodes, carrying current from one to the other. 
This phenomenon has often been observed in connection with electrostatic 
printing and has been specifically reported by Cho (C2). Such a transfer of 
particles would result in a transfer of charge to the electrodes far in excess 
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of that originally carried by the deposited particles and could introduce a 
serious error in all methods that measure charge transfer to charged, electrodes 
(Methods IA-la or IA-lb in Table XIV). Daniel and Brackett (Dl), Penney 
and Lynch (P2, Method No. 1), and Hewitt (H10, Method No. 1) used 
relatively low potential gradients so that this effect may not have been serious. 
Berg, Fernish, and Flood (B6) used very high potential gradients and this 
effect may have been large. From the published data, it is not possible to reach 
any conclusions on this score. It is noteworthy, however, that they report 
much higher charge-to-mass ratios than other investigators working with 
similar systems, a result which could be explained by such extraneous oscilla¬ 
tory transfer. Ruling against this explanation, on the other hand, is the illust¬ 
rative current-time curve that they present. This curve would indicate that the 
current flow ceases suddenly once the aerosol flow ceases. If oscillatory trans¬ 
fer were involved, it would be expected that the current would drop gradually 
after the aerosol flow ceases. 

An electronic probe has been evaluated by Guyton (G10) and by Geist, 
York, and Brown (G2) for measuring particle size. In both devices an aerosol 
stream impinged on a pickup wire, which was connected to electronic equip¬ 
ment wherein the pulses were amplified, classified by magnitude, and counted. 
Guyton used primarily a grounded wire and found that all materials gave a 
positive pulse proportional to the square of particle diameter. Nonconductors 
(such as talc, dry potassium chloride, and barium carbonate) gave the highest 
pulse of almost the same magnitude. Conducting particles, such as iron, gave 
very weak pulses. By charging the pickup to 400 V, either negatively or 
positively, neither the sign nor magnitude of the pulse was altered when 
dealing with solid aerosol particles. With water, however, which normally 
also gave a positive pulse, a positive wire potential tended to neutralize the 
pulse, completely neutralizing it at +22 V, and gave an increasingly large 
negative pulse at higher positive potentials on the wire. For negative potentials 
the pulse became increasingly positive. Geist applied a +430 V potential 
on his pickup. He made measurements by allowing previously grounded 
metal spheres to strike the pickup wire (a 4-in piece of 18-gauge copper wire) 
and by allowing drops to fall on the wire. Only negative pulses were observed. 
No measurable pulses were obtained with nonconducting liquids such as 
carbon tetrachloride, benzene, toluene, or kerosene or by striking the pickup 
gently with a polystyrene rod. Water, acetone, and alcohol did give a pulse, 
the latter two giving the smaller pulse. Small pulses (0.1 V) were obtained 
when the wire was struck a sharp blow with the polystyrene rod. Geist 
found the magnitude of the pulse with the conductors to be proportional to 
the 3.6 power of sphere diameter. 

Guyton concludes that with nonconductors the charging process is due to 
particle electrification either upon impact or during passage through the 
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nozzle prior to impact. Geist explains his charges with conducting particles 
on the basis of the capacity increase when the spheres or drops come in contact 
with the wire pickup. Guyton arrives at this same explanation for his results 
with water. It should also be noted that the apparent sensitivity of Geist’s 
measurements was 10,000 /iV at the pickup wire while Guyton’s sensitivity 
at that point was much better, 60 fiY. The impaction velocities in Guyton’s 
tests were also quite high, approximately sonic velocity (350 m/sec). The lower 
size limit of applicability of Guyton’s instrument was 2-3 microns. 

Keily (K2) also reports on the development of a similar system for measur¬ 
ing ground fogs in the size range of 5 microns and larger. He reports a pulse 
signal proportional to particle surface. However, the results indicated either 
no dependency or a less-than linear dependency of signal on the magnitude 
of any potential applied to the pickup, for positive potentials up to 450 V. 

While these devices were intended as a means for size analysis, the principle 
should be applicable to charge analysis. There remains, however, the problem 
of distinguishing between original charge on a particle and charge generated 
upon interception by the probe and reconciling some of the seeming differences 
between the results of Guyton and Geist. [Appendix begins on p. 84.] 
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Conversion Factors for Commonly Used Units 


oo 



Quantity 

To convert to the MKSA 

From the units 



Symbol 

Description 

units listed below 

listed below fl 

Multiply by 

Or divide by 

A 

area, surface 

meter 2 

cm 2 

10~ 4 

10 4 




ft 2 

0.09200 

10.764 

c 

concentration 

kg/meter 3 

ftg/liter 

10- 6 

10« 




mg/meter 3 

10-« 

10« 




mg/ft 3 

3.531 x 10- 5 

28,320 




grains/ft 3 

0.002288 

437.0 

V 

capacitance 

farads or 

statfarads 

(10 5 /c 2 ) = 1.11265 X 10' 12 

8.98755 x 10“ 



C/V or 
(C) 2 /J 




D, L 9 R 

diameter 

meter 

microns 

10- 6 

10 6 


distance, 


cm 

10- 2 

10 2 


length. 


in. 

0.025400 

39.370 


radius 


ft. 

0.3048 

3.2808 




mile 

1609.35 

6.2137 x lO" 4 


diffusivity 

meter 2 /sec 

cm 2 /sec 

10“ 4 

10 4 




ft 2 /sec 

0.09290 

10.764 

E 

potential 

Vor 

statvolts 

(10" 6 c) = 299.79 

0.00333564 



J/C 

ergs/electron 

6.24181 x 10 11 

1.60210 x 10" 12 
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<? 

potential 

V/meter 

V/cm 

10 2 

io - 2 


gradient. 


statvolts/cm 

29,979 

3.33564 x 10" 5 


field 


V/micron 

10 6 

io - 6 


intensity 





F 

force 

N or 

dyn 

10~ S 

10 s 



(kg)(meter)/(sec) 2 

lb force 

4.4482 

0.22481 

G 

mass 

(kg)/(sec)(meter 2 ) 

(lb)/(sec)(ft 2 ) 

4.88240 

0.20482 


velocity 


(gm)/(sec)(cm 2 ) 

10 

io - 1 

m 

mass 

kg 

grain 

6.480 x 10- 5 

15,432 




gm 

IO" 3 

10 3 




lb 

0.45359 

2.2046 

P 

pressure 

N/meter 2 or 

dyn/cm 2 

10“ l 

10 



(kg)/(sec) 2 (meter) 

lb force/ft 2 

47.882 

0.020885 




mm Hg(0°C) 

133.322 

0.0075006 




in. water (15°C) 

248.87 

0.004018 




lb force/in. 2 

6895. 

1.4503 X 10~ 4 




atm 

1.01325 x 10 5 

9.869 x 10“ 6 

9 

energy, work 

J, 

eV 

1.60210 X 10" 19 

6.24181 x 10 18 



(C)(V), or 

erg 

10" 7 

10 7 



(kg)(meter) 2 /(sec) 2 

ft-lb force 

1.3558 

0.73755 




cal 

4.1868 

0.23885 




Btu 

1055.04 

9.478 x 10~ 4 


electrical 

C 

electrons 

1.60210 X IO" 19 

6.24181 x 10 18 


charge 


statcoulomb 

(0.1/c) = 3.33564 x IO" 10 

2.9979 x 10 9 


surface 

C/meter 2 

electron/cm 2 

1.60210 x 10- 15 

6.24181 x 10 14 


charge 


electron/micron 2 

1.60210 X 10“ 7 

6.24181 x 10 6 


concentration 


statcoulombs/cm 2 

3.3356 x 10- 6 

2.9979 x 10 5 




coulombs/cm 2 

10 4 

IO" 4 


Continued p° 
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Conversion Factors for Commonly Used Units (< Continued) 


Symbol 

Quantity 

Description 

To convert to the MKSA 
units listed below 

From the units 
listed below* 

Multiply by 

Or divide by 

R c 

charge-to-mass 

c/kg 

statcoulomb/gm 

3.3356 x 10- 7 

2.9979 x 10 6 


ratio 


electrons/fijug 

1.60210 x 10~ 4 

6,241.81 




ftC/g m 

10" 3 

10 3 

u 

velocity 

meter/sec 

cm/sec 

10“ 2 

10 2 




ft/sec 

0.30480 

3.2808 




mile/hr 

0.4470 

2.237 

V 

volume 

meter 3 

cm 3 

10" 6 

10 6 




U.S. gal 

0.003785 

264.2 




cu ft 

0.028316 

35.316 

P 

viscosity 

(kg)/(meter)(sec) 

centipoise 

10~ 3 

10 3 




poise or (gm)/(cm)(sec) 

lO ” 1 

10 




(lb)/(ft)(sec) 

1.4882 

0.6720 

P 

density 

kg/meter 3 

lb/ft 3 

16.019 

0.06243 




gm/cm 3 

10 3 

io - 3 

G 

surface 

N/meter or 

dyn/cm or erg/cm 3 

10‘ 3 

10 3 


tension 

7/meter 2 





a Calories are gram calories. All units in each group are arranged in the order of increasing magnitude of the unit. 
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Universal Constants and Defined Values 


Constant Value 

c speed of light = 2.997925 X 10 8 meter/sec. 

[l/c = 3.3356405 X 10" 9 sec/meter; c 2 = 8.98755 x 10 16 (meter/sec) 2 ; 
1/c 2 = 1.112650 x 10~ 17 (sec/meter) 2 ] 
g standard gravitational acceleration = 9.8067 meter/sec 2 

h Planck’s constant = 6.6256 x 10“ 34 (J)(sec) 

k R/N = Boltzmann constant — 1.38054 x 10“ 23 J/°K 

m e mass of electron = 9.1091 X 10' 31 kg 

m p mass of proton = 1.67252 X 10' 27 kg 

N Avogadro’s number = 6.023 x 10 26 molecules/kg mole 
Q charge on electron — 1.60210 x 10" 19 C 
R gas constant = 8314 J/(°K)(kg mole) 

R c ^ charge-to-mass ratio for electron = 1.75880 X 10 11 C/kg 

R cp change-to-mass ratio for proton = 9.5790 x 10 7 C/kg 

e permittivity of free space = 10 7 /47tc 2 
= 8.854 x 10' 12 (C) 2 /( meter) 2 (N) 

[4778 = 1.11265 X lO" 10 (C) 2 /(meter) 2 (N)] 


Properties of Standard Air 

Conditions 

Temperature, T = 25°C = 298°K = 77°F = 539°R 

Pressure, p = 1 atm = 1.013 X 10 6 dyn/cm 2 = 14.7 lbs/in. 2 
= 1.013 X 10 5 N/meter 2 

Molecular weight, M = 29.0 gm/gm mole = 29.0 kg/kg mole 
Properties at Above Conditions 

Density, p = 1.185 x 10“ 3 gm/cm 3 = 0.0740 lb/ft 3 
= 1.185 kg/meter 3 

Viscosity, /r = 1.840 X 10“ 4 poise = 1.840 X 10" 5 (kg)/(meter)(sec) 

Mean molecular velocity, u m = (SRTIttM) 1/2 = 46,600 cm/sec 
= 1530 ft/sec = 466.0 meters/sec 

Mean free path, A m (simple kinetic theory) = 3 fi/pu m = 0.1000 microns 

= 1.000 x 10“ 7 meters 

Smoluchowski flocculation function K s /k c ~ 4k773 fx = 2.980 X 10“ 10 cm 3 /sec 

= 1.052 x 10" 14 ft 3 /sec 
= 2.980 X 10" 16 m 3 /sec 
= 298.0 micron 3 /sec 
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Nomenclature 

All equations are given on a dimensionally consistent basis and can be used with any 
dimensionally consistent units. The illustrative units given in the following are based on 
the MKSA system, using the rational basis for electrical units. For the cgs-esu (irrational) 
system, the corresponding cgs-esu units would be used and the permittivity, e, would have 
a value ofl/(47r). Universal constants and defined values are represented by a symbol in 
Gothic (sans serif) or in bold face type. 


a Acceleration, meters/sec 2 
A Surface area of a bounded volume 
V , meters 2 

A b Ground area covered by cloud in 
time /, meters 2 

c Speed of light, 2.9979 x 10 8 
meters/sec 

c mass concentration, kg/meters 3 
c p Mass concentration of particles, 
kg/meters 3 

c P o Initial mass concentration of par¬ 
ticles, kg/meters 3 

( ct) c Critical dosage (kg)(sec)/(meters 3 ) 
# Capacitance, farads 
C D Drag coefficient, dimensionless 
d Differential of 

D Diameter; diameter of spherical 
cloud, meters 

D c Diameter of wire or cylinder, 
meters 

Dp Particle diameter, meters 
Dp f Diameter of floe, meters 
Dpi Diameter of particles of one size, 
meters 

D p2 Diameter of particles of another 
size, meters 

D P o Initial diameter of drop, meter 
D pS f a Apparent Stokes settling diameter 
of floe, meters 

{n ^ u,j[k c g(p p - p)]Y' 2 
B Diffusion coefficient, meters 2 /sec 
Diffusion coefficient for particles, 
of diameter D p , meters 2 /sec 
Dpi Diffusion coefficient for particles 
of diameter D pi , meters 2 /sec 
Bp 2 Diffusion coefficient for particles 
of diameter D p2 , meters 2 /sec 
e Natural logarithmic base, 2.718 * 

E Potential or potential difference, V 
Ei Ionization potential, V 


E ps Specific particle potential = & P ! 
Irrth D p , V 

& Potential gradient or field intensity, 
V/meter 

Field intensity at surface of wire or 
cylinder, V/meter 

^0 Uniform external field intensity or 
that at surface of an electrode, V/ 
meter 

<^ max Maximum field intensity, V/meter 
£ ps Specific particle surface gradient 
for particle of size D py V/meter, 
Splash D p 2 

&psi Specific particle surface gradient 
for particle of size D pu V/meter 
&ps 2 Specific particle surface gradient 
for particle of size D pZ , V/meter 
Maximum stable value of & ps , V/ 
meter 

P s opt Optimum value of specific particle 
surface gradient, V/meter 
f Frequency, cps 

f p Probability of an electron occupy¬ 
ing a given quantum energy level, 
dimensionless 
F Force, N 

F a Force of attraction or adhesion, N 
F e Force due to electric field, N 
F r Resisting force due to fluid drag, 
N 

F r Force of repulsion, N 
g Standard gravitational accelera¬ 
tion, 9.8067 meters/sec 2 
G Mass velocity, (kg)/(meter 2 )(sec) 
G de Mass deposition rate due to elec¬ 
trostatic forces, (kg)/(meter 2 )(sec) 
G dg Mass deposition rate due to gravity, 
(kg)/(meters 2 )(sec) 
h Planck’s constant — 

6.6256 x 10~ 34 (J)(sec) 
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j Current, amp 

j L Current per unit length (of wire or 
tube), amp/meter 

j A Current per unit area, amp/meter 2 
k Boltzmann constant = R/N = 
1.38054 x 10- 23 J/°K 
k c Stokes-Cunningham correction 
factor, dimensionless 
k em Correction factor on flocculation 
rate to allow for both particle 
charge and particle mean free path, 
dimensionless 

k d Distribution factor, dimensionless 
k F $ Factor relating attractive force to 
maximum field intensity for proxi¬ 
mate bodies of opposite charge, 
dimensionless 

k 5 Sticking factor, dimensionless 
k sA Factor to allow for inductive 
shifting of charges on proximate 
conducting, oppositely charged 
particles, dimensionless 
k S i% Factor to allow for proximity 
effects on capacitance between two 
charged bodies, dimensionless 
k s # Factor to obtain the maximum field 
intensity from the specific particle 
surface gradient of proximate 
charged bodies, dimensionless 
k sE Factor to obtain the maximum 
field intensity from the nominal 
voltage gradient between proxi¬ 
mate charged bodies, dimension¬ 
less 

k sR Factor to allow for inductive 
shifting of charges on proximate 
conducting particles of like charge, 
dimensionless 

K s Smolukowski flocculation co¬ 
efficient, meters 3 /sec 
K w Proportionality factor in equation 
for force of adhesion by van der 
Waal’s forces, N/meter 
K 5 Proportionality factor, C/meter 2 
K 6 # Proportionality factor, V/meter 
In Natural logarithm of 
L Distance between centers of two 
particles or bodies, meters 
L m Migration distance, meters 
L s Clearance between proximate sur¬ 
faces of two bodies, meters 


m Total mass of particles, kg 
m e Mass of electron == 9.1091 X 10~ 31 
kg 

m t Mass of ion, kg 
m p Mass of proton = 1.67252 X 10' 27 
kg 

m p Mass of particle of diameter D p , 
kg 

m pi Mass of particle of diameter D pli 
kg 

m p2 Mass of particles of diameter D pl 
kg 

m pf Mass of floe, kg 
M Molecular weight of molecule, kg/ 
kg mole 

Mi Molecular weight of ion, kg/ 
kg mole 

n Order of surface zonal harmonic, 
dimensionless 

ti e Concentration of electrons in 
space, electrons/meters 3 
t7 ( Ion concentration in space, ions/ 
meters 3 

n p Numberconcentration of particles, 
particles/meters 3 

n p i Number of particles of size D pU 
dimensionless 

N Avogadro’snumber = 6.023 x 10 26 
molecules/kg mole 

N m Parameter to allow for slip be¬ 
tween particles during collision, 
dimensionless (see Eq. 20) 

N me Parameter that measures effect of 
particle slip as corrected for par¬ 
ticle charge, dimensionless (see 
Eq. 19) 

N q Parameter that measures effect of 
charge on flocculation, dimension¬ 
less (see Eq. 18) 

N Rep Particle Reynolds number == 

D p Urplp, dimensionless 
p Pressure, N/meter 2 
p e Pressure due to electrostatic stress 
in a fluid, N/meter 2 
p p Equilibrium vapor pressure from 
particle of size D p , N/meter 2 
Pt Total pressure, N/meter 2 
p a Constraining pressure due to sur¬ 
face tension, N/meter 2 
Poo Vapor pressure from a flat surface, 
N/meter 2 
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Q Charge on the electron = 1.60210 

xio- 19 c 

21 Charge or charge at time t , C 
Magnitude of arbitrary point 
charge, C 

J &2 Magnitude of another arbitrary 
point charge, C 
Initial charge, C 

2t p Particle charge or charge on par¬ 
ticle of diameter D Pi C 
£ pl Charge on particle of diameter 
D P u C 

2tp2 Charge on particle of diameter 
Z> P 2,C 

2l pa Average charge on each of other 
particles, C 

2l ps Surface concentration of particle 
charge, =^ p /7rZ> p 2 , C/meter 2 
Surface charge concentration. 
C/meter 2 

R Universal gas constant = 8314 
(J)/(°K)(kg mole) 

R Radius, meters 

Resistivity of bulk material, 

(ft) (meter) 

R c Charge-to-mass ratio = £ P lm p > 
C/kg 

t time, sec 

t a Time for objects at edge of cloud to 
receive critical dosage of agent, 
sec 

t p5 Specific particle stopping time — 
k c p P Djl 18^, sec 
t R Relaxation time = 
e8 v , sec 

t sd Specific dilution time = fip P ! 

2/tccSj, c ps» sec 
T Absolute temperature, °K 
u Fluid velocity, meters/sec 
Hi rms velocity of ions = 

(3k77m,) 1/2 = mT/Mi) 112 , 
meters/sec 

u m Particle migration velocity, meters/ 
sec 

u m Mean molecular velocity = 

[8kr/(7rA//N)] 1/2 = [8RTI(rrM)] 1/2 
meters/sec 

u, Relative velocity between particle 
and fluid, meters/sec 


u R Radial outward velocity of ex¬ 
panding cloud, meters/sec 
V Volume, meters 3 
x Distance, meters 
u tg Terminal gravitational settling 
velocity of particle, meters/sec 
a First Townsend ionization co¬ 
efficient, meters' 1 

pi Ion mobility, (meters/sec)/(V/ 
meter) 

P P Mechanical particle mobility = 
uJF r , (meters/sec)/(N) 

18 pe Electrostatic particle mobility = 
uj&, (meters/sec)/(V/meter) 

8 Dielectric constant of medium 
surrounding particle, dimension¬ 
less 

8 P Dielectric constant of particle, 
dimensionless 

S v Dielectric constant of a volume, 
dimensionless 

Si Dielectric constant of one material, 
dimensionless 

8 2 Dielectric constant of another 
material, dimensionless 
e Permittivity of free space = (10 7 / 
47rc 2 ) — 8.854 x 10- 12 (C) 2 / 
(meter) 2 (N) 

A t Mean free path of ions, meters 
A m Mean free path of electrons, ions, 
or molecules, meters 
a Surface tension of liquid or surface 
energy of solid, N/meter or J/ 
meter 2 

p Fluid density, kg/meter 3 
p„ Particle density, kg/meter 3 
p g Density of pure gas phase, kg/ 
meter 3 

pi Absolute gas viscosity, (kg)/ 
(meter)(sec) or dekapoise 
£ Probability factor for producing 
ionizing photon, dimensionless 
£ Probability factor for producing 
photoelectrons, dimensionless 
r y Tensile strength of liquid, N/ 
meter 2 

<p Work function for surface, J 
<p F Fermi energy level, J 
<p T Quantum energy level, J 
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I. Introduction 


A. Mechanisms and Models 

For any physical system there exists a precise mathematical and physical 
representation of all of the phenomena that make up the system. A chemical 
reaction is no exception, since it does possess some true mechanism that is 
descriptive of every microscopic detail of the reaction. In practice, of course, 
a complete description of this mechanism cannot be obtained, and approxima¬ 
tions must be made. For example, for studies of a fundamental nature on 
heterogeneously catalyzed reaction systems, this may entail reference to some 
average property, such as an average pore size, an average size of a particle 
of metal dispersed on a support, an average adsorbed state, or an average 
mode of reaction. For a broader problem such as the modeling of the opera¬ 
tion of an entire plant, we must make more extensive approximations. For 
industrial reactions generally we are fortunate if we can characterize the 
reactor feed in terms of a limited number of major components, if we can 
precisely define the reaction kinetics of these major components in the 
industrial scale, and if we can evaluate the residence time distribution of the 
reactor. Consequently, we will often be concerned with models of the system 
that are at best quasimechanistic, simply because our reacting system is chosen 
on the basis of economics rather than simplicity or convenience. This review 
presents statistical methods for developing mathematical descriptions (either 
theoretical or empirical) of reacting systems. 

In contrast to the above allusion to a mechanism, the term model has 
been used to describe a wide range of chemical engineering subjects. Chemical 
engineering modeling activities in general can be thought of as belonging to 
one of three members of a nested set. The outermost member is plant model¬ 
ing. This entails the description of, for example, an entire refinery complex, 
taking into full account topics ranging from crude resources to geographical 
product distributions. Economic models of such a complex obviously require 
a knowledge of the behavior of the several processes making up the plant. 
Hence, modeling activity within the plant-wide modeling is required, that is, 
process modeling. Process models are capable of predicting the steady-state 
level of operation at the settings of the process variables, and perhaps the 
dynamic response of the process to any disturbance. Several process models 
must be included in the plant model. The development of the process model, 
in turn, cannot be carried out without recourse to phenomenological or 
mechanistic modeling. The mechanistic model must be capable of describing 
adequately the basic physical and chemical steps that take place within the 
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processes, under the conditions imposed by the settings of the process 
variables. 

Considerable unification of the concepts of plant modeling and process 
modeling has taken place in recent years, for example, as described in Refer¬ 
ences (F4) and (R3) for process and plant concepts, respectively. Attention 
has been devoted only intermittently, however, to the development of 
methodology for mechanistic modeling. For reaction-rate modeling, in 
particular, there was considerable activity in the 1930’s and 1940’s and again 
in the late 1950’s and 1960’s. The latter activity apparently resulted in large 
part from the introduction of computer techniques into chemical engineering 
technology. This review will be concerned specifically with methodology 
useful in the elucidation of such reaction rate models, treated artificially as 
two distinct stages. In the specification stage, the problem is to determine the 
appropriate functional form of a rate model that adequately describes the 
reaction. Although parameter estimates are often necessary to test the 
adequacy of any particular functional form, these estimates often need not 
be very precise. After a model has been specified, it is often advantageous to 
give particular attention to the estimation of the parameters within the 
adequate functional form. Once the functionality of the rate equation is 
known, the estimation of the parameter values is relatively straightforward. 

Although examples of the methodology will utilize entirely reaction rates 
or reactant concentrations, the procedures are equally valid for other model 
responses. They have been used, for example, with responses associated with 
catalyst deactivation and diffusional limitations as well as with copolymer 
reactivity ratios and average polymer molecular weights. 


B. Reaction-Rate Models 


In the model-specification stage, we concentrate upon two types of 
reaction-rate models, the power-function model 


r — kC A a C B h 

and the hyperbolic model exemplified by 

__ Cb 

1 + Ca + &b Cb 


( 1 ) 

( 2 ) 


Here, r is a reaction rate, C A and C B are measures of concentration, and the 
remaining terms represent parameters to be estimated. 

Equation (1), of course, is simply a representation of the law of mass 
action, stating that the reaction rate is proportional to a concentration driving 
force. Such an equation has been applied to many reacting systems, particul¬ 
arly those of considerable complexity. 
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Equation (2) is generally derived based upon some form of steady state 
hypothesis. Such models have been applied to gaseous reactions (Lindemann 
theory), to enzymatic reactions (Michaelis-Menten mechanism), and to 
gaseous reactions on solid surfaces (Langmuir-Hinshelwood mechanism). 
Considerable discussion of the derivation of these equations has been pre¬ 
sented elsewhere (L2). In general, we refer to this model type as a hyperbolic 
model or, in the case of solid catalyzed gaseous reactions, as a Hougen- 
Watson model (to remove restrictions implicit in a Langmuir-Hinshelwood 
mechanism). 

The use of any model more complex than Eq. (1) has frequently been 
criticized as an attempt to read too much into a set of kinetic data. The 
implicit results of such a misapplication would be a proliferation of meaning¬ 
less constants; this would thus lead to unnecessary effort in data fitting and 
interpretation, or to misleading interpolation or extrapolation. 

The latter danger is, of course, potentially present any time any data 
interpretation is attempted, particularly if nature is assumed always to follow 
Eq. (1). The only course of action is to attempt to include as much theory in 
the model as possible, and to confirm any substantial extrapolation by 
experiment. It is erroneous, however, to presume that kinetic data will always 
be so imprecise as to be misleading. The use of computers and statistical 
analyses for any linear or nonlinear reaction rate model allows rather definite 
statements about the amount of information obtained from a set of data. 
Hence, although imprecision in analyses may exist, it need not go unrecog¬ 
nized and perhaps become misleading. 

However, the amount of error in the data is not generally the limiting 
factor in data interpretation. Rather, the locations at which the data are 
taken most severely hinder progress toward a mechanistic model. Reference 
to Fig. 1 indicates that the decision between the dual- and single-site models 
would be quite difficult, even with very little error of measurement, if data 
are taken only in the 2- to 10-atm range. However, quite substantial error 
can be tolerated if the data lie above 15 atm total pressure (assuming data 
can be taken here). Techniques are presented that will seek out such critical 
experiments to be run (Section VII). 

The concept of the reaction-rate model should be considered to be more 
flexible than any mechanistically oriented view will allow. In particular, for 
any reacting system an entire spectrum of models is possible, each of which 
fits certain overlapping ranges of the experimental variables. This spectrum 
includes the purely empirical models, models accurately describing every 
detail of the reaction mechanism, and many models between these extremes. 
In most applications, we should proceed as far toward the theoretical extreme 
as is permitted by optimum use of our resources of time and money. For 
certain industrial applications, for example, the closer the model approaches 
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S0 2 Oxidation 
752° F 



Fig. 1. Initial rate dependence on total pressure for S0 2 oxidation (M3). 

the true mechanism, the better the final plant design and operation may 
become. For example, for a high-throughput process of which many exist 
and many more are expected to be built, a precise mechanistic model can 
often be justified. In this area, in fact, recent developments in computing, 
process dynamics, and large-system design are increasingly demanding more 
sophisticated models of the mechanistic and transport phenomena of the 
plant in general and reaction kinetics in particular. On the other hand, the 
construction of a precise mathematical model can require considerable 
experimentation and analysis. It would often be more profitable to get a new 
process on stream with a grossly nonoptimum design than to delay con¬ 
struction for a precise definition of a kinetic model. For most applications, an 
economic balance must be achieved to allow flexibility for modeling as well 
as rapid progress toward the process design. 

C. Integral and Differential Models 

Equations (1) and (2) represent reaction rates and, as such, can represent 
directly only data from a differential reactor. In many cases, however, data 
are obtained from an integral reactor. Are the data to be differentiated and 
compared directly to Eqs. (1) or (2), or are the equations to be integrated 
with the conservation equations and compared to the integral data? 

If the modeling process has entered the estimation stage, then Eqs. (1) 
or (2) should clearly be integrated and fitted to the integral data. This may be 
accomplished quite easily, through analytical or numerical integration of the 
conservation equations combined with one of the estimation procedures of 
Section III. General programs providing such combinations have been 
described (H6). 
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In the specification stage, however, gross elimination of rival models is 
difficult to accomplish using such generalized techniques with a set of existing 
data (cf. the example of Section III,A,3). Instead, careful comparisons of 
the features of the data and the theoretical model surface are required; these 
are most easily compared in the simplest possible model form, i.e., the 
differential form (see, for example, Section V). Furthermore, the experi¬ 
mental design procedures of Section VII can consume substantial computer 
time, a problem made more severe by iterative numerical integration. With 
the exception of purely empirical modeling (as in Section VI, which is always 
in the estimation stage), then, it is preferred to carry out the initial model 
specification with the simplest form of the model with which the data can be 
made consistent, For example, factors such as severe variation in catalyst-bed 
temperature can require the integrated equations even in the specification 
stage. 


II. Linearly Reducible Models 

Most of the modeling procedures commonly used require that the model 
first be reduced to a form which is linear in the unknown parameters. This 
procedure represents very good tactics; the technique will be exploited 
frequently in this review, particularly in Section V. If the scope of the models 
to be used or the range of experimental variables to be explored is not limited 
when applying this philosophy, the procedure also represents good modeling 
strategy. 

For these reasons, a proper balance must be achieved between the lineari¬ 
zation tactics and the overall modeling strategy. Both linear and nonlinear 
methods will be illustrated in the review, along with the problems encountered 
when relying too heavily on either single approach. First, however, some of 
the more common linear procedures will be discussed. 


A. Power-Function Models 

For a single reacting component, Eq. (1) reduces to a form that can easily 
be compared to rate data to determine (a) if the model is adequate and, if so, 
(b) the best estimates of the rate constant and the reaction order. For this 
case, Eq. (1) becomes 


r = - dCJdt = kC A a 


( 3 ) 


Here, t is a measure of reaction time. To analyze data from certain reactor 
types using this model, one can use one of the following methods. 
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1. The Method of Integration 

Here, the rate equation, such as Eq. (3), is integrated to yield 

C\- a -C\Z a = (a-i)kt a# 1 (4) 

In (C A IC A0 ) = -kt a = 1 

Now, appropriate plots of the data are made, which, if linear, would indicate 
that the assumed model of Eq. (3) is adequate. For example, if ln(C A /C A0 ) 
were linear with t, a first-order model would be adequate. Alternatively, 
one could assume a model (including the value of the parameter a), calculate 
the rate constant k at each data point, and tabulate the constants. If these 
“constants” remain constant, or if there is a reasonable trend of the con¬ 
stants with any independent variable, then the data do not reject the assumed 
model. For example, the value of In k would be expected to be independent 
of the value of the reaction time and to change linearly with the reciprocal 
of the absolute temperature. 

2. The Method of Differentiation 

In this method, the reaction rate is read directly, say, as the slope of the 
concentration-time data. Then, the logarithm of the rate is plotted versus 
the logarithm of the concentration; if the data lie along a straight line, the 
slope is equal to the reaction order. 

3. Fractional-Life Methods 

Models may also be tested by utilizing the time required for a given fraction 
of a reactant to disappear, since this varies with the initial concentration in a 
fashion characteristic of the reaction order. For example, if the half-life of a 
reaction is defined as the time required for one-half of the initial amount of 
reactant to be consumed, then Eq. (4) may be written 


I2 a ~ 1 — 1 

,n - ln U. -1) 

)-(« —l)ln C A o 

a # 1 

ti/2 = 0.693 Ik 


a = 1 


Here t l/2 is the half-life and C A0 the initial concentration of reactant. Now, 
half-life data (versus C A0 ) may be analyzed by the same techniques used in 
the method of integration. 

Several other methods are available for analyzing reaction-rate data, such 
as utilizing linear least squares in the above methods, or such as the method of 
dimensionless curves. The procedures and their advantages and disadvantages 
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have been adequately discussed elsewhere (L2); numerous examples of the 
techniques have also been set forth (B3, F5, H7, L2, W2) and are not discussed 
further here. 

4. Multicomponent Cases 

For the more usual multicomponent case, as described by Eq. (1), varia¬ 
tions of the above methods are used to allow a data analysis. One method 
is to take raction-rate data at such large concentrations of one reactant that 
this concentration is effectively a constant during the entire course of the 
reaction: 

r = kC/C b B0 - k'C A a (6) 

The order of the other reactant can be determined by any of the previously 
discussed methods. This technique, called the isolation method, will allow 
a determination of the component reaction orders, but it should be kept in 
mind that a very limited region of the experimental space has been covered 
in determining these orders. Thus, because the model has not been tested for 
conditions in which both concentrations are varying, the model should be 
used with caution here. 

A second method of analyzing data to be described by Eq. (1) is to use 
initial reaction rates. Here, one can vary the initial concentrations of the 
individual reactants (holding all other reactant concentrations constant), 
measure the rate at zero time for each case, and analyze the data by the 
previous methods. An advantage of this technique over the isolation method 
is that the concentrations of the reactants can be nearly equal instead of some 
concentrations being in large excess. The method allows reaction rates to be 
obtained over the entire range of composition with respect to known major 
reaction participants. However, these rates may not be equal to those ob¬ 
tained from experiments in which extensive conversions are allowed, due 
to the presence of trace byproducts generated during the reaction that affect 
reaction rate. 

For more complex reacting systems, the simple modeling procedures 
described above are not sufficient. In these more complex systems, the kinetic 
model describing the system will more generally consist of a group of coupled, 
nonlinear differential equations. For specific systems, standard analysis 
techniques are available (Wl, B5); more general procedures have also been 
made available through recent developments of computing hardware and 
software, and are discussed in the Sections III-V. One very real danger in the 
treatment of any such system is the inclusion of extraneous and unnecessary 
parameters, simply because the descriptive equations become so complex 
that the contributions of the parameters are hidden. Also, with such multi- 
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parameter systems, convergence of nonlinear estimation programs can be¬ 
come difficult. Techniques reducing such problems are described. 


B. Hyperbolic Models 


The hyperbolic model types have very commonly been used in the analysis 
of kinetic data, as discussed in Section I. Such applications are sometimes 
justified on the theoretical bases already alluded to, or simply because models 
of the form of Eq. (2) empirically describe the existing reaction-rate data. 
Considerably more complex models are quite possible under the Hougen- 
Watson formalism, however. For example, Rogers, Lih, and Hougen (Rl) 
have proposed the competitive-noncompetitive model 


r = 


71 * 1*2 PiP2 


+ 


y 2 *!*2 P\Pi 


(1 + K iPi + K 2 p 2 y (1 + K lPl )( 1 + K lPl + K 2 p 2 ) 


( 7 ) 


for cases in which large and small molecules react over a solid catalyst. 

Shabaker (SI), for the hydrogenation of propylene over a platinum 
alumina catalyst, selected a Hougen-Watson model of the form 


where 


r = K 2 L 2 (CJL) 3 Pl p 2 + k 2 K l L(CJL) 2 p 1 p 2 


Cl 

L 


(l + K 2 p 2 ) { 
4K lPl 


, (1 + K 2 p 2 ) 2 , 

' -4 s' ~r r 0 0 T* . 


1 


T 1/2 


16*iPi 2 K iPi , 


( 8 ) 


The adsorption and rate constants in each case exhibit an exponential 
temperature dependence. The study of these models, in addition to the 
linearizable models of the form of Eq. (2), have become possible through 
the use of nonlinear least squares (see Section III). 

The usual data analysis procedures for the linearizable models typified 
by Eq. (2) consist of (1) isolating a class of plausible rival models by means 
of plots of initial reaction-rate data as a function of total pressure, feed 
composition, conversion, or temperature; (2) fitting the models passing the 
screening requirements of the initial rates by linear least squares, and further 
rejecting models based upon physical grounds. 


1. Dependence of Reaction Rate on Experimental Variables 

The examination of the initial rate dependence upon total pressure is by 
far the most common means of examining the various classes of rival models 
in terms of their ability to fit the observed data. Yang and Hougen (Yl) have 
presented a classification of a multitude of possible models in terms of the 
dependence of the predicted rate upon total pressure. Generally speaking, the 
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analysis may be performed by writing each of the candidate models in terms 
of its dependence upon total pressure. For an alcohol dehydrogenation, for 
example, two models that might be considered are the dual site 

/cKjrfe -JapJK) 

(1 + K e p E + K a p A + K h p H ) 2 


and the single site 


kK E (p E ~~ Pa PhIK) 

(1 + K e Pe + K a Pa) 


( 10 ) 


For initial reaction rates r 0 with a pure alcohol feed, all product partial 
pressures are zero, and p AO = y A0 n, so that for the dual site 


kK E 7i 

r ° = (l +X £ 7T) 2 

and for the single site 

kK E n 

r ° = (T + M 


(ii) 


( 12 ) 


At low 7 i, the denominator simplifies to unity in each case and both models 
are linear in n. For sufficiently high n, the parenthesis in the denominator 
approaches K E n ; the initial rate for the dual-site model then approaches 
zero, and that of the single-site model approaches a constant value. Thus the 
plot of the experimental data will indicate that the dual-site model is prefer¬ 
able if a maximum exists in the data, or that the single-site model is prefer¬ 
able if a horizontal high-pressure asymptote exists. Flence, for the data of 
Franckaerts and Froment (FI) shown in Fig. 2, the dual-site model is pre¬ 
ferred over the single-site model. 

In addition to the maximum point, inflection points of the rate data can 
be used for testing model adequacy (M6). 

The more commonly applied procedure, however, is to linearize the models 
of Eqs. (11) and (12) to yield 



K e 

(kK E ) i/2 


71 


(13) 

(14) 


The data should lie along a straight line when plotted as (7i/r 0 ) 1/2 vs 7i, if the 
dual-site model is adequate. If, in addition, the dual-site model is to be pre¬ 
ferred over the single-site model, the plot of njr 0 vs n should be curved 



MATHEMATICAL MODELING OF CHEMICAL REACTIONS 


107 



Fig. 2. Initial reaction rate versus total pressure for alcohol dehydrogenation, 250 
and 285°C. 



Fig. 3. Linearized initial rate plots for alcohol dehydrogenation, 285°C. 

(cf., for example. Section V). The data of Fig. 3, replotted from Fig. 2 accord¬ 
ing to Eqs. (13) and (14), suggest that the dual-site model is adequate and is 
preferred over the single-site model. 

Often it is advisable to utilize the slopes and intercepts of such plots as 
well as the above described curvature of the data. Kabel and Johanson (Kl), 
for example, could eliminate an adsorption-controlling model with only 70% 
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Fig. 4. Linearized initial rate plot for alcohol dehydration, Eq, (16). 

certainty using the line curvature for the model representing dehydration of 
ethanol to ether over an ion-exchange-resin catalyst: 

^(Pa ~ PePw/K) 

1 + L(KaIK) 2 Pe PwY 12 + KyyPw 

For ten data points reported at varying alcohol partial pressures ( p A ) and 
zero water partial pressure ( p w ), this model becomes 

r 0 = kLp A (16) 

This suggests that a plot of r 0 vs p A (Fig. 4) or r 0 /p A vs p A should be 
linear. It is very difficult to reject this model on the basis of data curvature, 
even though it is evident that some curvature could exist in Fig. 4. However, 
Eq. (16) demands that Fig. 4 also exhibit a zero intercept. In fact, the 99.99% 
confidence interval on the intercept of a least-square line through the data 
does not contain zero. Hence the model could be rejected with 99.99% 
certainty. 

The dependence of the reaction rate upon conversion, temperature, and 
composition has not been used as widely as the pressure dependence in 
model screening. The conversion behavior is discussed in detail in Section V. 
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The utility of the overall dependence of the reaction rate upon temperature 
appears to be slight, perhaps in some cases allowing a discrimination between 
adsorption and surface reaction classes of models. The study of the residuals 
of various estimated parameters as a function of temperature can clearly 
indicate model inadequacies (Section IV) and, in some cases, can lead to 
model modifications correcting these model inadequacies (Section V). 

2. Magnitude of Estimated Parameters 

The second stage of this analysis involves the fitting of the model to the 
data and examination of the parameter values thus obtained to ensure their 
physical reasonableness. In particular, (a) the estimated adsorption and rate 
constants should be positive; (b) a plot of the logarithm of the rate constant 
with reciprocal absolute temperature should be linear with a negative slope; 

(c) a plot of the logarithm of the adsorption constant with reciprocal absolute 
temperature should be linear with a positive slope (exothermic adsorption) 
and generally a negative intercept (decrease in entropy with adsorption); 

(d) the model should adequately fit the data (Section IV) with the best 
estimates of the parameter values. These criteria should, of course, be used 
with caution. For example, it is possible that adsorption could be endother¬ 
mic if some of the adsorbed molecules dissociate. The use of these criteria 
has been discussed at length elsewhere (K7). Much of the present discussion 
is necessarily deferred until Section III. However, general cautionary mea¬ 
sures must include an examination of the confidence interval for any para¬ 
meter estimate before one rejects a given model. For example, it is illustrated 
in Section III that although an estimated adsorption constant is negative, 
its confidence region can easily contain positive values. Also, constants 
estimated to be negative by unweighted linear least squares can be estimated 
to be positive by nonlinear least squares or weighted linear least squares 
(Section III). The general utility of examining the magnitude of the estimated 
parameters can be illustrated by the data of Kabel and Johanson (Kl), which, 
at zero water partial pressure, can be shown to reject the single-site model 

— ~ PePw/K) 

(1 + KaPa + KwPw) 

The appropriate linear plot of p A 2 /r 0 vs p A is shown in Fig. 5. With the 
amount of scatter in the data for this plot, there is no justification for rejecting 
the model because of curvature of correlation in the locus of data points. 
However, the intercept of this graph suggests that the adsorption constant is 
significantly negative. If a model that possesses theoretical insight into the 
reacting system is desired, then this model must be rejected. 
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An example of a model rejection based upon temperature coefficients 
can be obtained from the single-site model considered by Ayen and Peters 
(A3) for the reduction of nitric oxide over a commercial catalyst: 

r _ fc^NO Pno Ph 2 
1 + ^noPno + &h 2 Ph 2 

This model was fitted to the data of all three temperature levels, 375, 400, 
and 425°C, simultaneously using nonlinear least squares. The parameters 
were required to be exponentially dependent upon temperature. Part of the 
results of this analysis (K6) are reported in Fig. 6. Note the positive tempera¬ 
ture coefficient of this nitric oxide adsorption constant, indicating an endo¬ 
thermic adsorption. Such behavior appears physically unrealistic if NO is not 
dissociated and if the confidence interval on this slope is relatively small. 
Ayen and Peters rejected this model also. 


111. Parameter Estimation 

In any modeling procedure, values of the predicted rate must approximate 
the values of the observed rate before an adequate model has been estab¬ 
lished. A simple indication of the comparative shapes of the predicted and 
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Fig. 6. Temperature dependence of adsorption constant for nitric oxide reduction, 
Eq. (18). 

observed rate surfaces can be obtained by procedures described in Section II. # 
For the final check of the agreement of the model and the data as well as for 
further use of the model, estimates of the parameters within the model must 
be obtained. Linear and nonlinear least squares have proven to be of signific¬ 
ant value for this purpose. For effective utilization of the parameter estimates 
thus obtained, the precision of the estimates should also be measured. 

A. Least Squares 

1. Linear Least Squares 

Parameter values producing predicted-model results near the observed 
results have frequently been selected by minimizing S : 

S = t(y u -h? (19) 

u= 1 

This method of least squares is not only intuitively desirable, but also provides 
estimates having desirable properties, if certain assumptions are met (D4). 
For models that are linear in the parameters, that is, models of the form 

i = l 


( 20 ) 
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the parameters may be estimated analytically by linear least squares. In using 
unweighted linear least squares, however, it is desirable that the x’s be free 
of error, that the response y have constant error variance, and that this error 
be independent and of mean zero. 

For a simple linear model 

P. = b o + M« (21) 

the least-squares estimates b 0 and b x may be obtained by substituting Eq. (21) 
into Eq. (19), differentiating, and setting the first derivatives to zero: 


r)S N 

x7-=-2l(y tt -b o -b lXu ) = 0 

VV 0 u=l 

dS N 

*r = - 2 1 x u(y u -b 0 - MJ = 0 

obi 11=1 

Solving these two equations in two unknowns, 
l w 

1_ ^EM-(Z^) 2 


( 22 ) 

(23) 


(24) 

(25) 


where the summations range from u = 1 to u = N. In the more general matrix 
notation, the least-squares estimates for Eq. (20) become 


b = (X 1 X) -1 X 1 Y 


(26) 


where b is a p x 1 column vector, Y is an N x 1 column vector, and X is an 
N x p matrix of the levels of x. For Eq. (21), 


b = 



(27) 


Y = 


y i 
y 2 


ly N A 

'1 


X = 


1 X 2 
1 x 3 


1.1 - T vJ 


(28) 


(29) 


The equations represented by Eq. (26) are termed the normal equations; they 
provide parameter estimates possessing the minimum distance from the 
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data Y to the model surface, thus representing a line that is perpendicular 
(or normal) from the point Y to the surface. 

Many kinetic equations can be suitably linearized to the form of Eq. (20). 
For example, Eq. (1) can be transformed logarithmically, or Eq. (2) can be 
transformed reciprocally. Two equations proposed for describing pentane- 
isomerization data (Cl, Jl) are the single site 


r fcK 2 (P2 - P3/X) 

(l + KiPi + K2P2 + K 3 p 3 ) 

and the dual site 


kK 2 (p 2 - pjK) 

(1 + K 2 Px + K 2 P 2 + K 3 p 3 ) 2 


(30) 


(31) 


where p u p 2 , and p 3 are the partial pressures of hydrogen, normal pentane, 
and isopentane. Rearranging and redefining parameters, 


~~ j = b o + b iPi + b 2 p 2 + b 3 p 3 (32) 

= b o + b 2 Pi + b 2 p 2 + b 3 p 3 (33) 


Unweighted linear least-squares estimates of these parameters are shown in 
Table I. The negative parameter estimates of the single-site model are of no 


TABLE I 

Parameter Estimates eor Pentane Isomerization 



Least-square technique 

Parameters 

Unweighted 

linear 

Weighted 

linear 

Unweighted 

nonlinear 

Single Site 

k 

189 

30.5 

36.2 

Ki 

-0.4 

2.0 

0.9 

k 2 

-0.03 

1.3 

0.5 

k 3 

-1.08 

4.3 

2.1 

Dual Site 

k 

289 

73.9 

133 

K t 

0.03 

0.03 

0.03 

k 2 

0.005 

0.02 

0.02 

k 3 

0.09 

0.07 

0.07 
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great concern, as they arise from a negative estimate of b 0 in Eq. (32) having 
a confidence interval (Section III,C) so large that positive values of b 0 may 
be chosen without significantly harming the fit of the data. 

Several variations of these concepts (e.g., stepwise regression) have also 
been proposed (D4). 

2. Weighted Linear Squares 

Thus far, we have assumed that all of the observations are independent 
and have the same inherent uncertainty. Deviations from such assumptions 
have been discussed in a generalized fashion elsewhere (H8). 

Taking into account deviations from constant variance is a particularly 
straightforward matter. If, for example, the variable y of Eq. (19) possesses a 
variance a- 2 which varies from data point to data point, then Eq. (19) divided 
by a 2 is the appropriate sum of squares to minimize. If, in general, we define 

~a 2 0 ••• 0 

c = o a 2 2 ••• 0 

0 0 ••• a N 2 

then Eq. (26) becomes 

b = (X 1 C -, X) -1 X , C -1 ^ 

or if p = Bx, this is simply 

c _£(*i.Kito 2 ) 

I (*<V) 

where the sum is taken over all observations. Now, if replication (Section IV) 
has been carried out at each data point, then the estimate of error variance 
at each data point, a 2 , can be inserted into Eq. (34) and the least-squares 
estimate calculated from Eq. (35). 

More typically, we have an indication that a transformed variable f(y) 
has constant error variance and will wish to use this information to weight 
y appropriately. For example, we may suspect logy has constant error 
variance and wish to fit y. More typically, we might feel that y has constant 
error-variance and wish to fit l/y. 

This problem was solved approximately in 1947 (B2, K9, Jl), wherein it 
was suggested that the transformed function of y be expanded in a linear 
Taylor series to provide 

variance [/(y)] = (Sf(y)ldy) 2 • variance (y) 


(34) 

(35) 

(36) 


(37) 
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For example, if the rate r is thought to have constant error variance and we 
wished to fit f(y) = {ajr) n as in Eq. (33), then the quantity 

*, 2 = l-nfiydlr t ]V (38) 

should be used in the weighting matrix C of Eq. (34) (the constants n and <r r 2 
are unnecessary). For comparison to the results of the unweighted least- 
squares analysis of the pentane isomerizations data just discussed, the second 
column of Table I should be examined. Here, the weighting function of Eq. (38) 
was used, assuming the rate r had constant error variance [in contrast to the 
function of r defined in Eqs. (32) and (33) in the unweighted case]. Note the 
significant shift in the weighted estimates from the unweighted estimates, 
including the positive single-site parameter estimates. Since the nonlinear 
least-squares results also assume r has constant error variance, there is a 
close correspondence in the weighted linear and the nonlinear results. 

Equations (34) and (35) can also be generalized to include correlation 
between observations. In such a case 

°n a\i ••• <*\n 

°21 a 22 ■■■ °2N 

2 2 2 
1 G N 2 ••• &NN 

where the afj(i # j) are the covariances between the observation errors. This 
variance-covariance matrix is again applied through Eq. (35). 

3. Nonlinear Least Squares 

Many of the models encountered in reaction modeling are not linear in 
the parameters, as was assumed previously through Eq. (20). Although the 
principles involved are very similar to those of the previous subsections, the 
parameter-estimation procedure must now be iteratively applied to a non¬ 
linear surface. This brings up numerous complications, such as initial esti¬ 
mates of parameters, efficiency and effectiveness of convergence algorithms, 
multiple minima in the least-squares surface, and poor surface conditioning. 

Iterative Techniques. In estimating parameters in a model that is non¬ 
linear in the parameters 

r=/(x;K) (40) 

we still minimize S of Eq. (19). Since the partial derivatives similar to Eqs. 
(22) and (23) do not generally lead to equations that are easily solved, we 
turn to one of three basic types of methods (K8), even though a wide variety 
of specific minimization techniques are available (S2). 
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In the linearization method , the nonlinear model of Eq. (40) is linearized 
by a truncated Taylor expansion: 


where 


yu = r u-f( x u'’K) = Y J f'iub i 
i — 1 


a/fr,; k> 

SKi 


K = K° 


(41) 


(42) 


Here K° is the set of initial parameter estimates. Since Eq. (41) is linear, 
estimates of the correction vector b may be obtained through Eq. (26), where 
y u is now given by Eq. (41) and X is given by 


x = {/;„} 


(43) 


Using estimates thus obtained, improved estimates of K { are given by 

K{ 1) = b t + K t ° (44) 


The procedure is repeated until the procedure converges, that is until the 
correction vector b approaches zero. Methods of modifying the size of the 
correction vector have been developed to improve convergence (B5) and this 
method, in theory, should always converge (HI). In practice, nonlinearities 
in the model and poor parameter estimates can prevent convergence. A 
modification of this method linearizes the normal equations; higher deriva¬ 
tives have also been used. 

In the steepest-descent method (BIO, B18, K8, S2, R2), the sums-of-squares 
surface is visualized to be a response surface in which the parameters are 
variables. Steepest-descent procedures are then applied to determine the 
minimum of the surface. The procedure is, then, to (a) select initial para¬ 
meter estimates; (b) set up a first-order design (B18) in the parameter space 
about these latter estimates, calculating the sum of squares at each point; 
(c) determine the direction of steepest descent; (d) proceed in this direction 
in the parameter space until the sums of squares begin increasing; (e) if the 
minimum is not near, go back to step (b); otherwise (f) set up a second-order 
design for precise location of the minimum, and continue toward the mini¬ 
mum, reparameterizing (Section III,B) as necessary to improve surface 
conditioning. Although this method converges for nearly any set of initial 
estimates, its convergence can be agonizingly slow for multiparameter 
problems (K8). 

Experience with fitting many models indicates that the steepest descent 
is very stable for the initial iterations, while the linearization method is 
more efficient for the final iterations. A compromise method has been suggested 
(L5, Ml, M2) which tends to emphasize steepest descent at the outset and 
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linearization in the final stages. In this method, the correction vector used 
in Eq. (44) is calculated from 

b = (X*X +^I)“ 1 X 1 Y (45) 

where y u is given by Eq. (41) and X l Y is the vector of steepest descent. Thus, 
a large value of \p makes the correction vector near the vector of steepest 
descent while a small \p makes the correction vector near that obtained by the 
linearization method. The compromise method, then, incorporates a con¬ 
tinually decreasing value of ^ as the iterations proceed. 

Numerous examples of applications of nonlinear least squares to kinetic- 
data analysis have been presented (K7, K8, L3, L4, M7, P2); an exhaustive 
tabulation of references would, at this point, approach 100 entries. Typical 
results of a nonlinear estimation and comparison to linear estimates are 
shown in Table I and discussed in Section III,A,2. Many estimation problems 
exist, however, as typified in part by Fig. 7. This is the sum-of-squares surface 
obtained at fixed values of K s and K u in the rate equation used for the cata¬ 
lytic hydrogenation of mixed isooctenes (M7) 


_ kKniKuPH Pu _ 

(1 + KhPh + KvPu + K s p s ) 2 


(46) 



Fig. 7. Contours of sums of squares of residual rates for isooctene hydrogenation, 
Eq. (46). 
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First, this figure shows long skewed contours that frequently occur with 
such models. Also, it should be noted that two minima exist in this plot, 
in the first and third quadrants. Accordingly, in the estimation of the rate 
parameters for this model with the standard nonlinear estimation techniques, 
it would be possible to obtain certain negative parameter estimates. In fact, 
it can be seen from Eq. (46) that any two of the parameters k, K H , or K v 
can simultaneously be negative without greatly hindering the fit of the equa¬ 
tion to the data owing to the fact that they are multiplied together in the 
numerator. These considerations are of importance in the estimation of the 
parameter values, since negative estimates of any adsorption constants are 
generally used as a justification for rejecting the model under consideration. 
Therefore, one should attempt to examine the sums-of-squares surface in 
enough detail to ensure that the model should truly be rejected and not just 
refitted. Measures of nonlinearity of the sum-of-squares surface are available 
(B3, B8, M2, M7). 

Part of the problems encountered with sums-of-squares surfaces such as 
shown in Fig. 7 can be relieved by obtaining precise initial-parameter esti¬ 
mates for the nonlinear estimation. Employing a pilot or first-stage estimation 
procedure to obtain values K° to use in an iterative nonlinear estimation 
program is often more efficient than simply making rough guesses, even 
though at first sight it may appear to be more time-consuming. Numerous 
examples of this type of pilot estimation have been described (K8). Figure 8, 
for example, presents the sums-of-squares surface for alcohol dehydration 
data and the model of Eq. (47). As indicated in the figure, any trial value of 
the two parameters in the first quadrant will, if restricted to the first quadrant, 
cause the nonlinear estimation procedure to diverge (M7): 

r = kK A(PA 2 ~ PePwIK) (4?) 

(1 + K A p A + K E p E + KwPw) 


A brief survey of the sums-of-squares surface, such as shown in this figure, 
can eliminate many computer hours of fruitless iteration with repeated initial 
parameter estimates. Other problems related to estimation from Figs. 7 and 
8 can be reduced by reparameterization, as discussed in Section III,B. 

If the nonlinear estimation procedure is carefully applied, a minimum in 
the sums-of-squares surface can usually be achieved. However, because of 
the fitting flexibility generally obtainable with these nonlinear models, it is 
seldom advantageous to fit a large number of models to a set of data and to 
try to eliminate inadequate models on the basis of lack of fit (see Section IV). 
For example, thirty models were fitted to the alcohol dehydration data just 
discussed (K2). As is evident from the residual mean squares of Table II, 
approximately two-thirds of the models exhibit an acceptable fit of the data 
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Fig. 8. Contours of sums of squares of residual rates for alcohol dehydration, Eq. (47). 

points used (rms ^ 1.2 x 10~ 6 ). The experimental design, as is shown later, 
plays a major role in determining the success of such a project (Section VII). 

For fitting such a set of existing data, a much more reasonable approach 
has been used (P2). For the naphthalene oxidation system, major reactants 
and products are symbolized in Table III. In this table, letters in bold type 
represent species for which data were used in estimating the frequency factors 
and activation energies contained in the body of the table. Note that the rate 
equations have been reparameterized (Section III,B) to allow a better esti¬ 
mation of the two parameters. For the first entry of the table, then, a model 
involving only the first-order decomposition of naphthalene to phthalic 
anhydride and naphthoquinone was assumed. The parameter estimates 
obtained by a nonlinear-least-squares fit of these data, 0, are seen to be 
relatively precise when compared to the standard errors of these estimates, s 9 . 
The residual mean square, using these best parameter estimates, is contained 
in the last column of the table. This quantity should estimate the variance 
of the experimental error if the model adequately fits the data (Section IV). 
The remainder of Table III, then, presents similar results for increasingly 
complex models, each of which entails several first-order decompositions. 



TABLE II : Summary of Parameter Estimates for Alcohol Dehydration 


Model 

No. 

k x 10* 

k a 

k e 

K w 

K e 

K w i- 

K a i- 

L 

Residual mean 
square X 10 10 

1 

1.49 

0 

0 

4.09 

.. 

_ 

_ 

_ 

1.09 

2 

2.26 

3.39 

0.13 

7.31 

— 

— 

— 

— 

0.70 

3 

8.63 

10- 8 

— 

3.68 

io- 8 

— 

— 

0.17 

1.13 

4 

5.28 

46.5 

— 

91.4 

666 

— 

— 

0.51 

0.75 

5 

8.98 

0 

0 

— 

— 

40.3 

— 

0.17 

1.36 

6 

9.72 

0 

0.19 

2,71 

io- 7 

— 

— 

0.15 

1.67 

7 

16.2 

4.26 

0.04 

1.34 

10" 6 

— 

— 

0.08 

6.71 

8 

15.8 

0 

0 

8.40 

— 

8.40 

— 

0.10 

1.45 

9 

5.0 

54.3 

0 

4.0 

— 

10 3 

— 

0.50 

1.45 

10 

1.44 

0 

0 

4.51 

—. 

— 

— 

— 

1.05 

11 

7.04 

2.89 

0 

6.30 

— 

— 

— 

— 

0.70 

12 

1.48 

10 4 

0 

10 4 

— 

— 

— 

— 

1.12 

13 

7.6 

0 

— 

4.7 

IO" 7 

— 

— 

0.19 

1.08 

14 

56.0 

15.4 

— 

30.6 

52.9 

— 

— 

0.64 

0.77 

15 

8.9 

0 

0 

— 

— 

45.7 

— 

0.17 

1.29 

16 

197 

4.4 

0 

— 

— 

253 

— 

0.21 

0.61 

17 

5.8 

0 

0.01 

4.03 

IO" 5 

— 

— 

0.25 

1.16 

18 

2.77 

2.74 

2.98 

4.71 

IO" 5 

— 

— 

0.06 

78.0 

19 

8.16 

0 

0 

0.44 

— 

IO" 8 

— 

0.14 

5.75 

20 

9.95 

3.69 

0 

2,01 

— 

16.6 

— 

0.90 

0.63 

21 

16.6 

— 

0 

1.86 

— 

— 

10~ 3 

0.30 

1.06 

22 

9.94 

— 

0 

10 4 

— 

— 

10 9 

0.15 

1.15 

23 

34.4 

— 

— 

1.74 

0 

— 

0 

0.21 

35.3 

24 

9.94 

— 

— 

10 4 

0 

— 

109 

0.15 

1.15 

25 

6.5 

— 

0 

— 

— 

5.4 

0 

0.48 

1.12 

26 

17.4 

— 

0 

— 

— 

0 

8.51 

0.17 

5.7 

27 

38.1 

— 

0.05 

1.03 

IO" 6 

— 

0 

0.19 

1.50 

28 

17.4 

— 

0 

0 

10“ 9 

— 

8.49 

0.17 

5.9 

29 

9.4 

— 

0 

1.15 

— 

0.90 

0 

0.40 

1.1 

30 

11.0 

— 

0 

10 5 

— 

0 

IO 10 

0.13 

1.2 
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By slowly increasing the complexity of the models in this fashion, it was hoped 
that a model could be obtained that was just sufficiently complex to allow an 
adequate fit of the data. This conscious attempt to select a model that satisfies 
the criteria of adequate data representation and of minimum number of 
parameters has been called the principle of parsimonious parameterization. 
It can be seen from the table that the residual mean squares progressively 
decrease until entry 4. Then, in spite of the increased model complexity 
and increased number of parameters, a better fit of the data is not obtained. 
If the reaction order for the naphthalene decomposition is estimated, as in 
entry 5, the estimate is not incompatible with the unity order of entry 4. 
If an additional step is added as in entry 6, no improvement of fit is obtained. 
Furthermore, the estimated parameter for that step is negative and poorly 
defined. Entry 7 shows yet another model that is compatible with the data. 
If further discrimination between these two remaining rival models is desired, 
additional experiments must be conducted, for example, by using the model 
discrimination designs discussed later. The critical experiments necessary 
for this discrimination are by no means obvious (see Section VII). 


B. Reparameterization 

The problems described in the last subsection can frequently be reduced 
by improving the surface conditioning through reparameterization. For 
example, if the two parameters in the first-order model 

C B = exp {-* 0 exp(—£/RT)} (48) 

are estimated from a set of data (H9), then one contour of the sums-of- 
squares surface can be represented as shown in Fig. 9. With such “ natural ” 
parameterization, an iterative nonlinear least-squares routine will generally 
converge to the minimum sum of squares quite slowly or not at all. However, 
writing the model in an equivalent mathematical form, but with different 
parameters, can lead to better sums-of-squares surface conditioning. 

Considerable discussion of reparameterization and examples of its useful¬ 
ness have been published (B3, B8, B12, Gl, G2, M7). Although several 
specific techniques are useful, one reparametrization of kinetic models often 
necessary is a redefinition of the independent variables so that the center 
of the new coordinate system is near that of the experimental design. In 
particular, the exponential parameter 


k = k 0 exp(—E/RT) 


(49) 



K> 


TABLE III 

Summary of Nonlinear Fitting of Naphthalene Oxidation Data 


k 0 = k Q exp (-EIRT) E' = EjR 

s 2 X 

Entry Mechanism" 12 3456 123 45 6a 10 3 

1 N— l^P S 8.79 0.842 — — — — 14.5 12.4 — — — — — 1.06 



O' 

t 

2 

S B 

0.26 

0.162 

— 

— 

— 

2 

A—l^P 

$ 

9.23 

0.873 

0.833 

_ 

_ 


N~ 2^Q 

N— 3^G 

SB 

0.26 

0.146 

0.151 

— 

— 

3 

N—1-+P 
f 4 

§ 

8.10 

2.16 

0.882 

18.8 

— 


N—2^Q 
JV— 3-+G 

SB 

0.61 

0.59 

0.142 

7.8 


4 

N —1 -*p—5- 

f 4 

>M 6 

8.10 

3.00 

0.881 

28.0 

0.831 


A—2-^Q 

A—3-*G 

SB 

0.73 

0.74 

0.124 

8.9 

0.221 


0.4 2.8 — — — — — 

14.6 12.4 9.68 — — — — 0.772 

0.4 2.8 2.34 — — — — 


15.0 

11.9 

9.80 

7.41 

— 

— 

— 

0.671 

1.0 

3.2 

2.09 

5.60 

— 

— 

— 


15.6 

13.0 

8.86 

9.69 

16.8 

— 

— 

0.471 

1.1 

2.6 

3.96 

.80 

3.5 

_ 

_ 
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5 


0.475 


TV— 1-vP—5^M 0 8.26 3.55 0.926 31.3 0.750 — 15.9 13.5 9.93 17.6 9.05 — 1.13 

f 4 

TV—2-*Q 5g 0.88 1.11 0.142 10.4 0.236 — 1.3 2.8 1.85 3.9 3.8 — 0.15 

TV—3->G 


6 

TV—1-*P—5^M 
f 4 

§ 

8.09 

2.76 

1.20 

29.2 

0.861 

-4.15 

15.6 

12.8 

11.1 

16.6 

8.84 

10.9 

— 0.480 


TV—2->Q 

ss 

0.72 

0.74 

0.48 

9.0 

0.227 

6.10 

1.1 

2.7 

3.2 

3.6 

3.90 

12.1 

— 


16 


TV—3->G 


7 

TV—1->P—5->M 
f 4 

6 

7.62 

4.16 

— 32.0 

0.787 

10.7 

15.4 

12.6 — 

9.60 

17.8 

3.58 

— 0.497 


TV—2—Q—6->G 

Sd 

1.05 

1.10 

— 13.1 

0.234 

1.9 

1.5 

2.5 — 

4.64 

3.9 

2.33 

— 


N, naphthalene; P, phthalic anhydride; M, maleic anhydride; Q, naphthoquinone; G, off gases. 
6 oc denotes reaction order for naphthalene decomposition steps. 
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E x 10 4 , cal/gm mole 

Fig. 9. Approximate 95% confidence region for first-order decomposition model. 


should almost invariably be redefined as 


where 


*-C.’exp{-f(i-±y 


k 0 ' = k 0 exp (-E/RT) 


(50) 

(51) 


In the example above, the sums-of-squares surface is transformed to that 
shown by Fig. 10. The best point estimates of the parameters of k Q ' and E 
are thus more readily obtained than k 0 and E; the initial-parameter estimates 
are less critical, and the estimation routine converges more rapidly to the 
minimum. Although the correlation between the parameter estimates has 
been reduced by this reparameterization, the size of the confidence region of 
the original parameters k 0 and E will not change. 


C. Confidence Intervals and Regions 

In reaction-rate modeling, precise parameter estimates are nearly as 
essential as the determination of the adequate functional form of the model. 
For example, in spite of imprecisely determined parameters, an adequate 
model will still predict the data well over the range that the data are taken, 
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Fig. 10. Approximate 95 % confidence region for reparameterized first-order decom¬ 
position model. 

as shown in Fig. 11 (Kll). However, if these parameter estimates are to be 
subjected to mechanistic interpretation or are to be used in reactor designs 
(with slight extrapolation), then clearly additional information about the 
precision of the estimates is required. An example of the first situation is 
provided by the negative parameter estimates of Table I. As is shown clearly 
by the confidence region for this entry, positive parameter estimates are quite 
compatible with the data (K3). 

The amount of uncertainty in parameter estimates obtained for the 
hyperbolic models is particularly large. It has been pointed out, for example, 
that parameter estimates obtained for hyperbolic models are usually highly 
correlated and of low precision (B16). Also, the number of parameters 
contained in such models can be too great for the range of the experimental 
data (W3). Quantitative measures of the precision of parameter estimates 
are thus particularly important for the hyperbolic models. (C2). 

1. Linear Models 

For models of the form of Eqs. (32) and (33) [or, more generally, Eq. 
(20)], the (1 — a) 100% confidence interval for the parameters b t (not K t ) 
is given by 

b, ± K, liWbdV 2 ( 52 ) 
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Fig. 11. Dependence of predicted reaction rate upon assumed parameter estimates, 
Eq. (149). 

where t vi a/ 2 is the (1 — a/ 2 ) 100% point of the t distribution (Dl) with v 
degrees of freedom. If the errors in the dependent variable [e.g., left-hand 
side of Eq. (32)] are normally and independently distributed with constant 
variance a 2 , then v{b t ) is the ith diagonal element of 

v(b) = (X'xrV (53) 

If, on the other hand, weighted linear least squares is required, Eq. (53) 
becomes 

Kb) - (X'CT'xr'X'CrV (54) 

The use of these equations allow the calculation of confidence intervals to 
assess, for example, the significance of negative parameter estimates or the 
additional experimentation required to estimate the parameters with a 
particular precision. 

As has been described, the correlation of the parameter estimates is also 
of great interest. This correlation is measured by the olf-diagonal terms of 
r(b), completely ignored by the above equations. If an analysis of variance is 
conducted, one finds that a joint (1 — a) 100% confidence region is defined by 

(P — b) 1 X 1 X(P — b) = s 2 pF a (p, v) (55) 

Where P is a vector of parameter values being estimated by b, s 2 is an in¬ 
dependent estimate of the experimental error variance a 2 , p is the number of 
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parameters, and F a (p,v ) is the (1 — a) 100% point of the F distribution. 
Alternatively, 

So = Smin + s 2 pF a (p, v) (56) 


where S c is the value of the sum of squares on the 100 (1 - oc)% confidence 
contour, and S min is the minimum value of Eq. (19). As is described in Section 
IV, s 2 can be estimated from replicated data or, if the fitted model is adequate, 
from 

s 2 = S m J(N-p) (57) 

For two parameters in the model, Eq. (55) simplifies to 


(01 — ^l) 2 X x i "h 2(01 — ^l)(02 ” ^2) Yj X i X 2 + (02 “ ^ 2 ) 2 X X 2 2 

= s 2 pF„(p, v) (58) 

where the summations range over N observations. For fixed-data x i9 this 
equation describes an ellipse in the parameter space. Thus, confidence regions 
are elliptical (or more generally, ellipsoidal) for linear mo'dels. 

The covariance (off-diagonal) terms of Eq. (53) are also needed to cal¬ 
culate approximate intervals for parameters in the hyperbolic models. 
For a model such as 


1 

r 


1 K c K d 

l + T Pc + T Pd 


(59) 


Eq. (53) provides the variance-covariance matrix for the parameter ratios. 
Letting b u b 2 , and b 2 denote the respective ratios of Eq. (59), then Eq. (37) 
may be applied to find 

v(k) = k*v(b i) 

where v(b^) is available from the usual linear least squares Eq. (53). Using a 
generalized form of Eq. (37), we further obtain 

»(« - (Q V,) + (!)V) + (!) (!) cov (>., w (60) 

**»> - (!f) 2 '<« ++ (t) (ff) « <61) 

Confidence intervals thus calculated for the rate and adsorption constants 
have been reported (K7). 


2. Nonlinear Models 

The general approach used with nonlinear models, such as Eq. (40) is to 
linearize by a Taylor expansion [Eq. (41)] and apply the linear theory of 
Section III,C,1. 

The more approximate method of measuring the precision of parameter 
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estimates is to use confidence intervals, as defined by Eqs. (52) and (53). 
Here X is again given by Eq. (43). This estimate will be erroneous because 
the Taylor expansion does not perfectly describe the nonlinear model 
throughout the confidence intervals. Hence, although the intervals are 
calculated assuming the contours of the sums-of-squares surface are ellip¬ 
soidal, they deviate from this shape. An example comparing such contours 
is shown in Fig. 12, for estimates of the two rate constants in the sequence 
A -> B -► C (B14). In fact, the deviation of the actual sums of squares from an 
ellipsoidal shape can be used as a measure of the nonlinearity of the model 
(B3, B8), using latent roots of the (X*X) matrix or other indicators (Gl, 
G2, M7). 

A better measure of the precision of the parameter estimates for nonlinear 
models is provided by Eq. (56), which can take into account deviations of 
the sums-of-squares contours from the ellipsoidal shape. In fact, this equation 
does provide a rigorous confidence contour for nonlinear models; it is only 
the exact confidence level, a, associated with this contour that is approximated. 

Numerous applications of this theory have been made in calculating 
confidence intervals for parameter estimates in nonlinear kinetic models, such 
as typified in Table III (P2). The use of confidence regions is typified in 
Fig. 13 (M7) for the alcohol dehydration model 



2 3 A 5 6 7 89 


Fig. 12. Contours of actual sum of squares surface and of corresponding linear theory 
surface, A —+ B C. 
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and Fig. 32 for the nitric oxide reduction model of Section VII. Note the 
large volume of these regions and the long skewed relationship to the axes 
(high correlation of estimates). 

D. Multiple-Response Parameter Estimation 

Parameter estimation through Eq. (19) has been widely used for experi¬ 
mental situations in which only a single response is being measured. Fre¬ 
quently, however, measurements can be made on two or more responses 



Fig. 13. Approximate 95% confidence region for alcohol dehydration, Eq. (62). 
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simultaneously. For example, Eq. (19) could be used if only the concentration 
of B could be measured in the reaction sequence 

A->B-+ C (63) 


If independent data were available for all three responses, however, all of 
these data should be used in estimating the two rate constants. One obvious 
extension of Eq. (19) to this case is to choose parameter estimates 
minimizing 

S r = Z ( C A i ~ C Ai ) 2 + Z (Cjh — C Bi ) 2 + £ (C ci — C Ci ) 2 (64) 

i -1 i=l 1=1 

This has been applied previously (Bl). 

This criterion, and others, can be derived using maximum likelihood 
arguments (H8). It has been shown that Eq. (64) is applicable (a) when each 
of the responses has normally distributed error; (b) when the data on each 
response are equally precise, and (c) when there is no correlation between 
the measurements of the three responses. These assumptions are rather 
restrictive. 

A much more general criterion has been developed (B12, H8) for which 
these assumptions have been relaxed. The criterion developed when fitting 
a model, r , to observations, y, and for v responses is to select parameter 
values minimizing the determinant 




E (y« - r u) 2 E (y« 2 - r « 2 )(y« - r u )• • • E (y« v - OOV-O 
E ( y« - r u)(y 2 - r u 2 ) E (y* 2 - r2 ) 2 ■ • • E (y« v - r » v ](y « 2 - r «) 

E 6V -rM - C) E Gu 2 - r« 2 )(y u v - O • • • E W - O 2 


(65) 


where all sums range from u = 1 to u = N observations. For the three- 
response example above, v = 3, and the parameters would best be estimated 
by minimizing the 3 x 3 determinant of Eq. (65). 

The advantages to be gained by following such a procedure are quite 
striking. The great improvement in the size of the confidence region for the 
two rate constants (reparameterized logarithmically) of Eq. (63) is shown in 
Fig. 14 (B12), with the measured responses reproduced thereon. Such precision 
of parameter estimates not only greatly improves the efficiency of the estima¬ 
tion procedure, but it can also assist in providing hard-to-obtain agreement 
between thermodynamic and kinetic estimations of kinetic parameters (M4). 
An application of the procedure is discussed in detail elsewhere (M4). 
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Fig. 14. Approximate 99.75 % confidence regions for A —B —^ C. 


!V. Tests of Model Adequacy 

The several modeling methods discussed in the accompanying sections 
are quite useful in testing the ability of a model to fit a particular set of data. 
These methods do not, however, supplant the more conventional tests of 
model adequacy of classical statistical theory, i.e,, the analysis of variance 
and tests of residuals. 

A. Analysis of Variance 

The analysis of variance is used to compare the amount of variability 
of the differences of predicted and experimental rates with the amount of 
variability in the data itself. By such comparisons, the experimenter is able 
to determine (a) whether the overall model is adequate and (b) whether every 
portion of the model under consideration is necessary. 

1. Significance of the Overall Regression 

For every set of reaction-rate data, a total amount of variability in the 
data may be expressed as 
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Also, by a least-squares analysis or some other suitable means, estimates of 
the parameters within a proposed model may be obtained. This allows the 
calculation of predicted reaction rates at each experimental point and thus 
an assessment of the total amount of variability which can be accounted for 
by the proposed model 

N 

E ', 2 

(=1 

The difference in the predicted and observed rates (y t - r t ), is termed a 
residual and is a measure of the inability of the model to describe exactly 
the experimental data. If the model is entirely correct, in fact, the residual 
will be a measure of experimental error. A measure of the total amount of 
variation unaccounted for by the proposed model, then, is 

£ Oi - r ( ) 2 

i= 1 

In statistical terminology, these quantities are called the crude sum of squares, 

N 

I * 2 

*= 1 

the sum of squares due to regression, 

N 

£n 2 

i= 1 

and the residual sum of squares, 

E O'! - r i ) 2 

i— 1 

It is a direct result of the orthogonality property of linear least squares 
(Section III,A,1) that the crude sum of squares may be broken into two parts 
as follows: 

E y? = E r i 2 + E (yt - r i ) 2 ( 66 ) 

i - 1 i - 1 i- 1 

Associated with each data point is a certain degree of freedom, which will be 
used to attribute more information to, say, 500 data points than to 5 data 
points. In particular, if N data points are used, the total sum of squares is 
said to possess N degrees of freedom. The predicted rates estimated from a 
model containing p parameters have p degrees of freedom, and the remaining 
N — p degrees of freedom are possessed by the residual sum of squares. 

If several data points have been taken at the same settings of the indepen¬ 
dent variables, i.e., replicated data are available, then a knowledge of the 
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inherent amount of error in the data is measured by the pure-error sum 
squares, 

N 


£ O'. - y) 2 - 

i = 1 


of 


Here , y is the average of all of the replicated data points. If the residual sum 
of squares is the amount of variation in the data as seen by the model, and 
the pure-error of squares is the true measure of error in the data, then the 
inability of the model to fit the data is given by the difference between these 
two quantities. That is, the lack-of-fit sum of squares is given by 

£ (y - rd 2 = £(y, - r,) 2 - £ (y, - y ) 2 (67) 

i=l 1=1 * — 1 


If there are n replications at q different settings of the independent variables, 
then the pure-error sum of squares is said to possess (n — 1) degrees of 
freedom (1 degree of freedom being used to estimate y ); while the lack-of-fit 
sum of squares is said to possess N — p — q(n — 1) degrees of freedom, i.e., 
the difference between the degrees of freedom of the residual sum of squares 
and the pure-error sum of squares. 

The sums of squares of the individual items discussed above divided by 
its degrees of freedom are termed mean squares. Regardless of the validity 
of the model, a pure-error mean square is a measure of the experimental error 
variance. A test of whether a model is grossly adequate, then, can be made 
by acertaining the ratio of the lack-of-fit mean square to the pure-error mean 
square; if this ratio is very large, it suggests that the model inadequately fits 
the data. Since an F statistic is defined as the ratio of sum of squares of 
independent normal deviates, the test of inadequacy can frequently be stated 


lack-of-fit mean square 
pure-error mean square 


>F a [N-p-t ?(«-!), q(n- «] 


( 68 ) 


The F statistic is tabulated in many reference texts (Dl). More rigorous 
discussions of the analysis of variance are, of course, available (D4). 

One important application of analysis of variance is in the fitting of 
empirical models to reaction-rate data (cf. Section VI). For the model below, 
the analysis of variance for data on the vapor-phase isomerization of normal 
to isopentane over a supported metal catalyst (Cl) 

r = b 0 + + b 2 x 2 + 6 3 * 3 + 6 1A * 2 + b 22 x 2 2 + 6 33 * 3 2 

+ 612*1*2 + 6 13 *iX 3 + 623*2*3 ( 69 ) 


is reported in Table IV. Here, * t is the hydrogen partial pressure, * 2 the 
normal pentane partial pressure, and x 3 the isopentane partial pressure. It is 
evident from the F ratios of Table IV that the model adequately fits these 
data. 
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TABLE IV 

Analysis of Variance for Eq. (69) 



Sum of 

Degrees of 

Mean 


F statistics 

Source 

squares 

freedom 

squares 

Ratios 

95% 

97.5% 

Crude 

640.49 

24 

_ 

_ 

_ 

_ 

Due to regression 

637.35 

10 

63.74 

— 

— 

— 

Residual 

3.14 

14 

0.225 

— 

— 

— 

Lack of fit 

2.85 

11 

0.260 N 

) 2.73 

8.8 

14.3 

Pure error 

0.29 

3 

0.0954 7 





An analysis of variance can also be used to test the adequacy of more 
theoretical models. For example, two models considered in Section III for 
pentane isomerization are the single-site and dual-site models of Eqs. (30) 
and (31). These were linearized to provide Eqs. (32) and (33). The overall fit 
of these equations to the data may now be judged by an analysis of variance, 
reported in Tables V and VI (K3). It is seen that Eq. (33) fits the data quite 

TABLE V 


Analysis of Variance for Eq. (32) 



Sum of 

Degrees of 

Mean 

F statistics 

Source 

squares 

freedom 

squares Ratios 

95% 

97.5% 

Crude 

28894.02 

24 

_ _ 

_ 

— 

Due to regression 

26896.82 

4 

6,724.2 — 

— 

— 

Residual 

1997.20 

20 

99.86 — 

— 

— 

Lack of fit 

1964.56 

17 

115.56 v. 






y 10.62 

8.7 

14.3 

Pure error 

32.65 

3 

10.88 ' 





TABLE VI 




Analysis of Variance for Eq. (33) 




Sum of 

Degrees of 

Mean 

F statistics 

Source 

squares 

freedom 

squares Ratios 

95% 

97.5% 

Crude 

763.54 

24 

. . 

_ 

_ 

Due to regression 

754.29 

4 

188.6 — 

— 

— 

Residual 

9.25 

20 

0.462 — 

— 

— 

Lack of fit 

8.98 

17 

0.528 v 

> 5-9 
0.090 / 

8.7 

14.3 

Pure error 

0.27 

3 
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well. However, there is some question as to whether Eq. (32) adequately 
represents the data, since the ratio of lack-of-fit to pure-error mean square 
is relatively large (although not so large as to allow a clear rejection of the 
model, particularly since the y may not be independent normal deviates). 

In some cases when estimates of the pure-error mean square are unavail¬ 
able owing to lack of replicated data, more approximate methods of testing 
lack of fit may be used. Here, quadratic terms would be added to the models 
of Eqs. (32) and (33), the complete model would be fitted to the data, and a 
residual mean square calculated. Assuming this quadratic model will ade¬ 
quately fit the data (lack of fit unimportant), this quadratic residual mean 
square may be used in Eq. (68) in place of the pure-error mean square. The 
lack-of-fit mean square in this equation would be the difference between the 
linear residual mean square [i.e., using Eqs. (32) and (33)] and the quadratic 
residual mean square. A model should be rejected only if the ratio is very 
much greater than the F statistic, however, since these two mean squares are 
no longer independent. 


2. Significance of Terms in the Regression 

Although the overall adequacy of a model to fit the data may be tested as 
described above, it is frequently desirable to test whether all of the terms in 
an adequate model need be included. This may be accomplished by a straight¬ 
forward extension of Section IV,A,1, involving a decomposition of the 
regression sum of squares into the desired components. Suppose, for example, 
that we wished to test the necessity of including the quadratic and interaction 
terms of Eq. (69). We do this by calculating the crude sum of 

squares and the regression sum ot squares using the fitted linear model 

r = b 0 + + b 2 x 2 + b 3 x 3 (70) 

This is, then, the regression sum of squares due to the first-order terms of 
Eq. (69). Then, we calculate the regression sum of squares using the complete 
second-order model of Eq. (69). The difference between these two sums of 
squares is the extra regression sum of squares due to the second-order terms. 
The residual sum of squares is calculated as before using the second-order 
model of Eq. (69); the lack-of-fit and pure-error sums of squares are thus 
the same as in Table IV. The ratio contained in Eq. (68) still tests the adequacy 
of Eq. (69). Since the ratio of lack-of-fit to pure-error mean squares in Table 
VII is smaller than the F statistic, there is no evidence of lack of fit; hence, 
the residual mean square can be considered to be an estimate of the experi¬ 
mental error variance. The ratio 


extra mean square due to quadratic terms 
residual mean square 


> Fa(P> N ~ P) 


(71) 
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TABLE VII 

Complete Analysis of Variance for Eq. (69) 


Source 

Sum of 
squares 

Degrees of 
freedom 

Mean 

squares 

Ratios 

F statistics 

95% 97.5% 

Crude 

640.49 

24 

_ 

_ 

_ 

_ 

Due to first 
order regression 

627.0 

4 

156.8 

_ 



Extra due to second 
order regression 

10.34 

6 

1.72 

_ 

_ 

_ 

Residual 

3.14 

14 

0.225 

7.66 

2.9 

3.5 

Lack of fit 

2.85 

11 

0.260 

— 

— 

— 

Pure error 

0.29 

3 

0.0954 

2.73 

8,8 

14.3 


then tests the necessity of the retaining the six quadratic terms in Eq. (69). 
Here, the ratio of quadratic to residual mean squares is so large that the 
quadratic terms must be used for an adequate fit of the rate data. This 
procedure may be carried out for any number of terms as long as we begin 
from the bottom of the table as was done here. 

There are many cases of interest to kineticists in which the necessity for 
the inclusion of terms in nonlinear rate models must be tested. This may be 
done approximately by again calculating the regression sum of squares with 
and without the added term. Kabel and Johanson (Kl), for example, con¬ 
sidered the model 

_ — P E P wlK) ,-y\ 

(1 + K a P a + K e P e + K W P W ) 2 

in which they suspected K E was zero. This could be tested by comparing the 
residual mean squares to the pure-error mean square, with and without the 
inclusion of the K E term. Table VIII summarizes the information thus 


TABLE VIII 

Approximate Analysis of Variance for Eq. (72) 


Situation 

kL 

K A ' atrrT 1 

Kw atm 1 

K e > atm" 1 

Residual mean 
square 

With K e 

7.02 x 10-* 

2.91 

6.34 

8.7 X 10' 3 

7.0 X 10-' 1 

Without K e 

7.04 x 10-* 

2.89 

6.30 

— 

6.8 X 10- 11 


obtained. From the closeness of the residual mean square, there is no reason 
to suspect that the K E term is needed. In this case, the parameter estimates 
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foresee this situation, since the estimated value of K E is near zero and the 
parameter estimates obtained with the K E term excluded are near those with 
the K e term included. If similar results were reported with the K A term 
omitted, the residual mean square without K A would be greatly inflated over 
that including the K A term. 

B. Residual Analysis 

The analysis of variance techniques of Section IV,A have been seen to 
provide information about the overall goodness of fit or about testing the 
importance of the contribution of certain terms in the model toward provid¬ 
ing this overall fit of the data. Although these procedures are quite useful, 
more subtle model inadequacies can exist, even though the overall goodness 
of fit is quite acceptable. These inadequacies can often be detected through 
an analysis of the residuals of the model. 

A residual is defined as the difference between the observed and predicted 
values of some response of interest, such as the reaction rate. For example, 
suppose that an experimentally observed reaction rate y is linearly dependent 
on two partial pressures x x and x 2 

y = P 0 + 01*1 + 02*2 + * (73) 

and that the predicted rate r is obtained using the correct model, 

r - b 0 + b l x 1 + b 2 x 2 (74) 

If the residuals are estimated using some unbiased method such as linear 
least squares, the V s would be expected to equal the /?’s, so that the residual 
becomes 

y “ r = e (75) 

Since the difference between the predicted rate and the observed rate is 
attributable solely to experimental error, plots of this residual versus any 
independent variable should exhibit all of the characteristics of this error, 
such as being random with zero mean. If, on the other hand, the model of 
Eq. (74) is wrong in that the b 2 x 2 term is omitted, the residual should not be 
random when plotted versus either r or x 2 . Numerous suggestions have been 
made concerning the types of residual plots that are most revealing (Al, 
A2, B8, D4). These include plotting the residuals in overall plots, against 
predicted values r, against independent variables x i9 and against time. 

1. Overall Plots 

The overall plots consist of plotting the frequency of occurrence of the 
rounded values of the residual against the magnitude of the residual. These 
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plots allow an approximate assessment of the normality of the error distribu¬ 
tion provided by y — r if the model is correct. The plots are also a check of 
whether the mean of this error distribution is zero, as should be fufilled for 
linear and nonlinear least-squares applications. Hence, these plots allow an 
approximate check on some of the least-squares assumptions after the least- 
squares analysis has been completed. It should be noted that the mean of the 
residuals must be zero if the linear-least-squares calculations have been 
correctly carried out for linear models containing a constant [e.g., b 0 in 
Eq. (69)], so only a check on the algebra of linear least squares is obtained here. 

2. Predicted Value Plots 

A plot of the residual versus the predicted value, r, of a model can indicate 
whether the model truly represents the rate data. For example, residuals 
that are generally positive at low predicted rates and negative at high pre¬ 
dicted rates can indicate a model inadequacy, even though the overall test 
of an analysis of variance indicates that the model is acceptable. 

The analysis of variance for the model of Eq. (32), for example, for the 
data on the isomerization of normal pentane was shown in Table V; we 
concluded that the model was marginally acceptable. However, the plot of 
the residuals of Fig. 15 indicates that this overall fit is achieved by balancing 
predictions that are too low against predictions that are too high. Hence the 
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model should be rejected. 

These plots can also provide information about the assumption of con¬ 
stant error variance (Section III) made in the unweighted linear or nonlinear 
least-squares analyses. If the residuals continually increase or continually 
decrease in such plots, a nonconstant error variance would be evident. Here, 
either a weighted least-squares analysis should be conducted (Section III,A,2) 
or a transformation should be found to stabilize the error variance 
(Section VI). 

Figure 16 presents residuals of 72 data points at three temperatures for 
the hydrogenation of propylene, reported by Shabaker (SI); the model was 
fitted by us to these data using unweighted nonlinear least squares with 
Eq. (8). The nonconstant error variance shown here indicates that a weighted 
nonlinear least-squares analysis would have been more appropriate. The 
residuals trends were largely removed by Shabaker by fitting the logarithm 
of the reaction rate rather than the rate itself (a transformation of coordinates 
such as discussed in Section VI). 

3. Independent Variable Plots 

The plot of the residuals versus the independent variables can yield 
information as to which of the variables in the equation is causing the residual 



Fig. 16. Plot of residual rates versus predicted rate for Eq. (8). 
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trends that occur in the predicted value plots. The nonconstant error variance 
described in the above section also is exhibited in these plots, and can provide 
information useful in developing a weighting function. 

Of particular value in kinetic studies are residual plots using the linearized 
form of the Hougen-Watson equation. For the model of Eq. (18), for example, 
we obtain 


Pno Ph 2 
r 


kK NO k 


+ 


Kh 2 

kK no 


Pn 2 


(76) 


Reasoning analogous to that of Eqs. (73)-(75) indicates that if this model is 
wrong only in that adsorbed water should have been included, a plot of 
residuals versus water partial pressure should be linear. Or, if this model is 
erroneous only in that the hydrogen should have been considered to be 
dissociated, the residuals should be independent of the nitric oxide partial 
pressure and nonlinearly dependent upon the hydrogen partial pressure. 
It should be noted that these observations are strictly valid only if the data 
have been taken according to an orthogonal (e.g., factorial) design. This 
subject is treated more comprehensively in Section V; examples of these 
effects are included there. 


4. Time-Residual Plot 

The plot of residuals versus some measure of the time at which experiments 
were run can also be informative. If the number of hours on stream or the 
cumulative volume of feed passed through the reactor is used, nonrandom 
residuals could indicate improper treatment of catalyst-activity decay. In 
the same fashion that residuals can indicate variables not taken into account 
in predicting reaction rates, variables not taken into account as affecting 
activity decay can thus be ascertained. 

5. Parametric Residuals 

The residuals discussed thus far have been associated with some dependent 
variable, such as the reaction rate r. It is particularly advantageous in pin¬ 
pointing the type of defect present in an inadequate model to expand this 
definition to include parametric residuals. The parametric residual, then, 
is simply the difference between a value of a given parameter estimated from 
the data and that predicted from a model. For example, the dots in Fig. 17 
represent the logarithm of the alcohol adsorption constants measured in 
alcohol dehydrogenation experiments from isothermal data at each of several 
temperature levels (FI). The solid line represents the expectation that these 
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points can be linearly dependent upon reciprocal temperature. If they do not, 
a defect exists in the model. 

The plot of the residuals of this adsorption constant, using the data of 
Fig. 17, is shown in Fig. 18. A substantial trending effect is evident (it is also 
evident from Fig. 17). This is of interest in itself, but it is especially desirable 
to utilize the information of Fig. 18 to learn how the model must be modified 
to remove the observed defect. The other residual plots of the section can also 
assist this objective. This topic, called model-building, is treated in Section V. 



Reciprocal Temperature x I0 3 , °K 1 

Fig. 17. Dependence of ethanol adsorption constant on temperature—alcohol dehydro¬ 
genation. 



Fig. 18. Dependence of residuals of ethanol adsorption constant on temperature— 
alcohol dehydrogenation. 
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V. Use of Diagnostic Parameters 

Recently certain diagnostic parameters have been exploited to allow a 
discrimination among several rival models. These diagnostic parameters can 
be grouped into two broad classes—those that are inherently present in the 
model, and those that are introduced solely for the purpose of model 
discrimination. 

There are two primary advantages to the use of diagnostic parameters 
in reaction-rate modeling. First, the use of these parameters allows an easy 
analysis of the adequacy of the model. This is accomplished by choosing the 
diagnostic parameters such that a linear test of a nonlinear model may be 
obtained. Second, in some cases, the diagnostic analysis will not only indicate 
a model inadequacy but also can suggest the precise nature of the inadequacy. 
Then, the model can be appropriately changed to remove this inadequacy. 

A. Model Discrimination with Diagnostic Parameters 

1. Nonintrinsic Parameters 

A nonintrinsic parameter is one that is introduced into a model for the 
purpose of allowing the specification of a preferred model from a larger 
group of rival models. To be useful, such a parameter must simplify the ana¬ 
lysis procedures, allow a broadening of the data base of the analysis, or both. 
One type of nonintrinsic parameter useful in this regard enters as a multiplier 
of each of a series of predicted responses (C3, C4). For two models, such a 
method reduces (W4, W5) to defining a dependent variable 

z = y- 1/2 (/-! + r 2 ) (77) 

This variable is then to be fitted using the equation 

t = A(r 2 -r 1 ) (78) 

where A is the nonintrinsic parameter used in specifying which of models r x 
and r 2 is to be preferred. It is to be noted that if 

(79) 

then A should be -1/2, while if r 2 were adequate A should be +1/2. 

The procedure to be followed, then, is to estimate the parameters K within 
each reaction-rate model by some appropriate technique (K8). The intrinsic 
parameter A can then be estimated by linear least squares. Owing to experi¬ 
mental error in the data, this estimate of A will typically be neither plus nor 
minus one-half. Hence the remaining portion of the analysis is to estimate the 
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confidence interval for this discriminatory parameter estimate. If this con¬ 
fidence interval contains both possible values, this would suggest that no 
decision could be made about the most adequate model; if neither value is 
contained, both models should be viewed with suspicion. 

In the above analysis, y was considered to be a reaction rate. Clearly, 
any dependent variable can be used. Note, however, that if the dependent 
variable, y , is distributed with constant error variance, then the function z 
will also have constant error variance and the unweighted linear least-squares 
analysis is rigorous. If, in addition, y has error that is normal and indepen¬ 
dent, the least-squares analysis would provide a maximum likelihood estimate 
of A. On the other hand, if any transformation of the reaction rate is felt to 
fulfill more nearly these characteristics, the transformation may be made on 
y, r u r 2 and the same analysis may be applied. One common transformation 
will be logarithmic. 

This method has been applied (M5) for modeling the vapor-phase rate of 
dehydration of secondary butyl alcohol to the olefin over a commercial 
silica-alumina cracking catalyst. Integral reactor data are available at 400, 
450, and 500°F. Two models considered for describing this reaction are the 
single site 


MCaPa 

1 + K A p A + K w p w 


(80) 


and the dual site 


KKaPa 

(1 + K A p A + K wo p w ) 2 


(81) 


The usual plots of the initial reaction rate for 500°F are shown in Fig. 19. In 



Fig. 19. Linearized initial rate plots for sec-butanol dehydration, Eqs. (80) and (81), 
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spite of this apparently good fit of the initial rate data by both models, the 
analysis of the intrinsic parameter A will indicate that the single-site model 
cannot accurately represent the higher conversion data. In this case, the 
parameter A is defined. 

x - 1/2C*! + x 2 ) = A(x 2 - x x ) (82) 

where x is the experimentally measured conversion, and x x and x 2 are the 
conversions predicted by the integrated forms of Eqs. (80) and (81). The 
parameters within these latter equations were estimated by nonlinear least 
squares (M5). 

The values of A thus obtained are reported in Table IX. It is clear that the 


TABLE IX 

Estimates of Discriminatory 
Parameter, A 


Temperature °F 

Parameter estimate, A fl 

400 

0.36 ± 0.28 

450 

0.47 ±0.23 

500 

0.46 ±0.14 


“The numbers following the ± signs define 
the 95 % confidence interval on A, based upon 
residual mean squares. 


preferred value of A is plus one-half, and hence that the data do not reject 
the dual model r 2 . However the probability that a A of minus one-half could 
represent these data is less than 0.0001 %. Thus, the single-site model should 
be rejected. The parameter estimates for the dual-site model are reported 
elsewhere (M5); they approximately exhibit an exponential temperature 
dependence, with slopes compatible with the usual predictions of kinetic 
theory. 

2. Intrinsic Parameters 

An intrinsic parameter is one that is inherently present in or arises naturally 
from a reaction-rate model. These parameters, which are of a simpler func¬ 
tional form than the entire rate model, facilitate the experimenter’s ability 
to test the adequacy of a proposed model. Using these intrinsic parameters, 
this section presents a method of preparing linear plots for high conversion 
data, which is entirely analogous to the method of the initial-rate plots 
discussed in Section II. Hence, these plots provide a visual indication of the 
ability of a model to fit the high conversion data and thus allow a more 
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complete test of a model than does the initial-rate analysis alone. When used 
in conjunction with the linear initial-rate correlations, the high conversion 
plots provide estimates of all the parameters in models that exhibit only one 
adsorbed reactant and one adsorbed product. 

When any hyperbolic model is written in terms of fractional conversions 
instead of partial pressures, two groupings of terms inherently arise within 
the denominator These two groupings will be called the intrinsic parameters 
C x and C 2 . For example, when data are taken for the olefinic dehydration 
of a pure alcohol feed to a reactor, Eq. (80) becomes 

= _ kK A t(l - x)/(l + x)]tt _ 

1 + K^[(l - x)/(l + x)]7I + K w [x!{\ + x)]7I 

In this formulation, the reaction is assumed to be essentially irreversible; 
the existence of product partial pressures in the numerator does not alter the 
method of analysis to be discussed here, however. Equation (83) may be 
written 


(1 — x)n 

' ~{C t +C 2 x) 

(84) 

- 1 71 

1 kK A k 

(85) 

1 (K W -K A )^ 

kK A kK A 

(86) 


The C x term, then, is the collection of terms not multiplied by conversion 
while C 2 is the collection multiplied by the conversion, x. 

The use of the parameter C x in Eq. (84) was the subject of a portion of 
Section II, for Eq. (84) can be written at zero conversion in terms of the initial 
reaction rate: 


Cx 



(87) 


and Eq. (85) suggests that this variable should be linear with tt. From Eq. 
(84), values of C 2 can be estimated by 


C 2 


(1 — x)n n 
xr xr 0 


( 88 ) 


where C x has been written as shown in Eq. (87). Here, the several values of 
C 2 (corresponding to several conversion points) at each pressure level can be 
averaged to give a single value of C 2 at each pressure. Equation (86) requires 
that these estimates be linear with pressure. 
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If this linear analysis is to be used, the experimental conversion-space¬ 
time data should first be taken at several pressure levels. Using the C i analysis 
alone, then, the plots of C x versus total pressure should be made for a pre¬ 
liminary indication of model adequacy. If several models are found to provide 
near-linear C x plots, the complete linear analysis using the C 2 plots should 
assist in the discrimination among the remaining rival models. If a model is 
adequate, both the C x and C 2 points should be correctable by a straight line 
with a common intercept, as demanded by Eqs. (85) and (86). If only one 
model is found to be adequate following the initial C x analysis, the complete 
C x and C 2 analysis should still be carried out on this model to verify its 
ability to fit the high conversion data. 

Consider again the 500°F data for the alcohol dehydration just discussed. 
Equations (84)—(88) arose from Eq. (80). A similar set of equations can be 
generated from Eq. (81) (K5). Let us examine the two models through an 
analysis of their intrinsic parameters. 

The Cj (initial reaction rate) plots were presented in Fig. 19. Note again 
that the data may be correlated well by straight lines for both models. The 
C 2 values are correlated by the solid lines of Fig 20. Note that the dual-site 
values can again be correlated by a straight line, but that the single-site 
values of C 2 show a definite curvature. Alternatively, the 0.975 atm value of 
the single-site C 2 could be rejected, and the three high-pressure points 



Total Pressure, Atm 

Fig. 20. Plot of parameter C 2 versus pressure for sec-butanol dehydration, Eqs. (80) 
and (81). 



MATHEMATICAL MODELING OF CHEMICAL REACTIONS 


147 


correlated by a straight line. Since the latter procedure results in a negative 
intercept for C 2 [cf. Eq. (85)], either approach eliminates the single-site 
model from further consideration. Although the intercept from the dual-site 
C 2 plot is somewhat higher than that of the dual-site C r plot, the 97.5% 
confidence intervals for these two intercepts overlap. 

By using only simple hand calculations, the single-site model has been 
rejected and the dual-site model has been shown to represent adequately 
both the initial-rate and the high-conversion data. No replicate runs were 
available to allow a lack-of-fit test. In fact this entire analysis has been 
conducted using only 18 conversion-space-time points. Additional discussion 
of the method and parameter estimates for the proposed dual-site model are 
presented elsewhere (K5). Note that we have obtained the same result as 
available through the use of nonintrinsic parameters. 

B. Adaptive Model-Building with Diagnostic Parameters 

The development of an adequate mathematical model representing a 
physical or chemical system is the object of a considerable effort in research 
and development activities. A technique has been formalized by Box and 
Hunter (B14) whereby the functional form of reaction-rate models may be 
exploited to lead the experimenter to an adequate representation of a given 
set of kinetic data. The procedure utilizes an analysis of the residuals of a 
diagnostic parameter to lead to an adequate model with a minimum number 
of parameters. The procedure is used in the building of a model representing 
the data rather than the postulation of a large number of possible models 
and the subsequent selection of one of these, as has been considered earlier. 
That is, the residual analysis of intrinsic parameters, such as C x and C 2 , 
will not only indicate the inadequacy of a proposed model (if it exists) but 
also will indicate how the model might be modified to yield a more satisfactory 
theoretical model. 

1. Hyperbolic Models 

In the following discussion, we shall again separate the terms of a hyper¬ 
bolic model and identify two parameters and C 2 . As before, each of these 
two parameters will be a collection of terms, one of which is multiplied by 
conversion and one not multiplied by conversion. In previous formulations, 
however, we have oriented the discussion toward a familiar type of experi¬ 
mental design in kinetics: conversion versus space-time data at several 
pressure levels. Consequently, the parameters C x and C 2 were defined to 
exploit this data feature. Another, type of design that is becoming more 
common is a factorial design in the feed-component partial pressures. 
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Consequently the definitions of the parameters C x and C 2 to be used here 
will reflect this type of design. 

Let us first consider the type of analysis to be used. Suppose that the model, 
such as Eq. (83), were to predict that C x should be related to the feed-com¬ 
ponent partial pressures as 

£1 = b 0 + b 2 p 2 + b 4 p A (89) 

when in fact the true values of C x from the data were related as 

C x = Po + P 2 P 2 + P 3 P 3 + P 4 p 4 + £ (90) 

Then, for an orthogonal design and zero expected value of error, the expected 
value of the residual of C t is (H5) 

£(Ci - Q) = fi 3 p 3 (91) 

Hence, if a correlation of the residual with any independent variable can be 
detected (e.g., with p 3 ), then the model generating Eq. (89) is inadequate. 
Furthermore a modification of the manner in which p 3 enters the model is 
necessary. 

The analysis could easily become more complicated. If, for example, 

€1 = K + b t p 3 + b 3 p 3 (92) 

then 

E(C X - C x ) = —piPi + P2P2 + (93) 

In this case, the residual will be correlated with the weighted sum of the 
three partial pressures. Although a trend may still be detectable, we will not 
generally be able to determine with which of the variables the residual is 
correlated and thus will not know how to correct the model; a careful initial 
model choice circumvents this problem. 

Application to Methane Oxidation. This selection of an appropriate initial 
model can be accomplished as shown here for the complete oxidation of 
methane. A general representation of the surface reaction model is (K12) 

_ _ fcPiU - x )(Pi - 2j>i*) 2 _ 

r ~ [1 + K 3 p i(l - x) + K 2 (p 2 - 2p t x) + K 3 (p 3 + p 3 x) + X 4 (p 4 + 2p 3 x)J 

(94) 

where p u p 2 , p 3 , and p 4 are the feed partial pressures of methane, oxygen, 
carbon dioxide, and water, respectively, and * is conversion. Collecting 
terms and using the continuity equation for an integral reactor, 

dx _ p 3 (l - x)(p 2 - 2p { x) 2 
(C,+C 2 x)» 


8W/F 


( 95 ) 
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For this generalized model, then, 



(96) 

(97) 


C 1 = b 0 + b 1 p l + b 2 p 2 + b 3 p 3 + b 4 p 4 
C 2 = bp l 


(98) 

(99) 


The problem of specifying an adequate model will now be to determine (1) 
the exponent n by a C 2 analysis and (2) the denominator terms required by 
a C i analysis. Depending upon the particular surface-reaction model con¬ 
sidered, the terms within Eq. (98) can change greatly. For any surface- 
reaction model, however, C 2 remains the same. Equating Eqs. (97) and (99) 
provides an equation with two unknown parameters, h and n. Estimating 
n in this way from the reaction data will specify the power of the denominator 
of Eq. (94). Generally, the selection of any of the models with the appropriate 
n will eliminate the difficulties represented by Eq. (93) and hence allow an 
effective C x analysis. 

For the complete vapor-phase oxidation of methane over a palladium 
alumina catalyst, conversion-space-time data were taken at 350°C and 1 atm 
total pressure; the fractional factorial design of Table X (HI 1) specified the 
settings of the feed partial pressures of the reacting species. 

From these data, n has previously been estimated to be 3 (K12). By start¬ 
ing with models of n = 3, the possibility of an ineffective C x analysis is 
diminished. 

With n = 3, one model with a minimum number of parameters is model 
1 of Table XI. By choosing a model with the smaller number of parameters, 
of course, we tend to prevent including parameters in the models unless their 
presence is absolutely necessary (principle of parsimonious parameterization). 
The residuals of the diagnostic parameter, C u for this model are shown in 
Table XII, allowing an analysis such as that suggested earlier. Note that the 
perfect correlation of the signs of residuals with the water partial pressure 
of Table X suggests that model 1 should be modified by changing the manner 
in which the effect of water is taken into account. To maintain n = 3, this 
requires the inclusion of an additional parameter, resulting in model 2 of 
Table XI. The residuals of model 2 of Table XII, being perfectly correlated 
with the carbon dioxide level of Table X, suggest that the effect of carbon 
dioxide is improperly described by model 2. 

Keeping the number of parameters to a minimum, we are thus led to 
model 3 of Table XI. However, the residual trends of model 3 as shown in 
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TABLE X 


Experimental Design 


Run No. 

Pi 

P2 

Pi 

P4 

1 

“1 

-1 

-1 

+ 1 

2 

+ 1 

-1 

-1 

-1 

3 


4-1 

-1 

-1 

4 

+ 1 

+ 1 

-1 

41 

5 

-1 

-1 

+ 1 

-1 

6 

+ 1 

-1 

+ 1 

+ 1 

7 

“1 

+ 1 

+ 1 

+ 1 

8 

+ 1 

+ 1 

+ 1 

-1 

Pi 

-0.015 


P2-o.no 


Pi= ~ 

0.005 

P2 = 

0.060 


- Pi 

- 0.065 


Pa - 0.095 


/>3 

0.035 

^4 = 

0.055 



TABLE XI 


Matrix of Terms Comprising 
Postulated Models 0 


Model 

No. 

Methane 

Pi 

Oxygen 

Pi 

Carbon 

dioxide 

Pi 

Water, 

P* 

1 

1 

1 

0 

0 

2 

1 

1 

0 

1 

3 

1 

1 

1 

0 

4 

1 

0 

1 

1 

5 

0 

1 

1 

1 


fl 0 and L Gaseous and adsorbed state, 
respectively. 


Table XII would indicate that the effect of water is not described adequately 
by the model. Utilizing the principle of parsimonious parameterization, one 
can consider both water and carbon dioxide to be adsorbed and oxygen to be 
nonadsorbed, resulting in the three-parameter model 4. The residuals in 
Table XII for model 4, however, are correlated with the oxygen level. Hence 
model 5 would perhaps be preferable, for it likewise contains only three 
parameters while allowing adsorbed oxygen. The random residuals of 
Table XII for model 5 indicate that this model cannot be rejected using the 
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TABLE XII 

Residuals of Parameter C x for the Postulated Models x 10 3 


Run 

No. 

1 

Residuals for model 

2 3 4 

5 

1 

+ 1.17 

-1.38 

+2.49 

-7.18 

-0.29 

2 

-3.61 

-1.07 

-2.51 

-7.70 

+0.50 

3 

-4.99 

-2,21 

-3.75 

+7.54 

-1.08 

4 

+2.33 

-0.54 

+ 3.43 

+7.88 

+0.93 

5 

-1.18 

+ 1.81 

-2.37 

-7.00 

+0.24 

6 

+ 3.39 

+ 0.55 

+2.21 

-8.75 

-0.51 

7 

+4.01 

+ 1.42 

+ 2.97 

+ 8.04 

-0.08 

8 

-1.13 

+ 1.42 

-2.46 

+7.16 

+ 0.29 


data presented. Thus, a reaction-rate model adequately describing these 
experimental data is model 5: 

r = _ kikiPiO- ~ x)(Pi - 2/ii-x) 2 _ 

{1 + K 2 (p 2 - 2p L x) + K 3 (p 3 + p^x ) + K 4 (p 4 + 2p 1 x)} 3 

This model has previously been shown (Hll, K12) to have a residual mean 
square comparing favorably with that expected from pure error, as discussed 
in Section IV. It is to be noted that we have been led logically from one model 
to another within the small class of models for which n — 3 by the above 
analysis. For these data, adsorbed methane is not required; however, for 
data with higher methane concentrations, the adsorbed-methane term may 
be needed. 

The foregoing development represents a substantial deviation from, and 
in some cases a distinct improvement on, the more usual methods of obtaining 
a reaction model adequately describing a set of data. In particular, for the 
hyperbolic models, we have seen that it may be necessary to write a large 
number of possible models and to search extensively through these to find 
an adequate model. Common criteria for adequacy include the requirements 
that the rate data plot linearly with the proper choice of the ordinate, and that 
the parameter estimates be acceptable. For methane oxidation, more than 
eighty such models could be thus considered! By contrast, we started with the 
class of surface reaction controlling models, since experience has indicated 
that these models are generally found to represent reaction data. The analysis, 
then, provides first for an estimation of the number of sites participating in 
the reaction, w, without the necessity of the simultaneous estimation of the 
magnitudes of all of the individual adsorption constants in the model. Then, a 
model with the appropriate value of n and with the minimum possible 
number of adsorption constants is considered, and the experimenter is led by 
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the experimental data and the analysis to a model that, with a minimum 
number of parameter, adequately represents the data. 

2. Power-Function Models 

A similar procedure may also be used with the power-function models. 
For example, if we are considering the reaction (B14) 

A + B-+F 
A + F^G 

then an initial postulation of the form of the reaction rate model might be 


— dCJdt — k±C b “I - k 2 C F (101) 

— dC B /dt = fcjCfl (102) 

dCJdt = — k 2 C F (103) 

dCJdt = k 2 C F (104) 


These equations might be thought to apply for a high concentration of 
reactant A y at a given concentration of catalyst C, and at isothermal condi¬ 
tions. To describe measurements on the concentration of F as a function of 
time at each of the initial conditions of the 2 4 factorial design of Table 
XIII, we might tentatively entertain the integrated model: 

C F = (e- k2t - e~ k ") (105) 

k x — k 2 

This is simply the integrated form of the above equations. 

Since several isothermal data points exist as a function of time for each 
of the 16 run conditions of Table XIII, we can estimate the values of the 
parameters k x and k 2 for each of these 16 conditions. This may be done using 
Eq. (105) and nonlinear least squares; the parameter estimates are shown in 
Table XIII (as taken from Reference B14). 

Now, if the model of Eqs. (101)—(104) is oversimplified, the estimated 
values of the parameters k x and k 2 will likely be representable as 

k = C m AO C n co k 0 e- E ‘ RT (106) 

or 

In k 1 = m 1 In C AO + In C co + In k 0l — EJRT (107) 

In k 2 — ni 2 In C A q H - n 2 In Cco 4* In ko 2 — E 2 jRT (108) 

It should be noted at this point that we are confronted with a highly nonlinear 
problem involving extremely complex equations [if Eqs. (107) and (108) were 
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TABLE XIII 

Experimental Design and Estimated Parameter Values 


Run 







No. 

Cao° 

Cbo 

a r> a 

i L co 

D a 

-10 In k\ 

— 10 In k 2 

1 





79.74 

72.68 

2 

+ 

— 

— 

— 

61.99 

62.09 

3 

— 

+ 

— 

— 

77.49 

68.03 

4 

+ 

+ 

— 

- 

64.56 

62.04 

5 

— 

- 

+ 

- 

74.80 

67.25 

6 

+ 

— 

+ 

— 

59.91 

62.19 

7 

— 

+ 

+ 

- 

75.99 

69.35 

8 

+ 

+ 

+ 

— 

61.01 

62.40 

9 

- 

- 

— 

+ 

69.48 

65.09 

10 

+ 

— 

— 

+ 

55.70 

58.05 

11 

- 

+ 

- 

+ 

68.78 

64.31 

12 

+ 

+ 

— 

+ 

54.82 

58.19 

13 

- 

- 

+ 

+ 

65.71 

63.42 

14 

+ 

- 

+ 

+ 

52.04 

57.86 

15 

— 

+ 

+ 

+ 

65.22 

64.44 

16 

+ 

+ 

+ 

+ 

51.76 

57.01 

fl Values for pluses 

i and minuses for C AO , 

Cbo > Cco> and 

D , respectively, 

are given 

below: 







Cao 


Cbo 


C C o 

D, temperature 

+ 40: 

moles/liter 


2 moles/liter 


1 mM liter 

175°C 

- 20 

moles/liter 


1 mole/liter 


0.5 mM liter 

165°C 


inserted into Eqs. (101)—(104) ]. Rather, we have broken the problem into one 
easily solved equation [Eq. (105)] and two linear equations [Eqs. (107) and 
(108)]. In fact, the problem is entirely analogous to that of the previous 
subsection, except that the observed values of the parameters are contained 
in Table XIII and the predicted values are written as Eqs. (107) and (108). 
If we wished to adopt a solution similar to that used in model-building with 
hyperbolic models, we would assume the simplest form of Eqs. (107) and 
(108). Then we would predict parameter values k x and k 2 at each run, and 
compare the parameter residuals to the design of Table XIII. If the residuals 
were correlated with certain of the variables, we could introduce these 
variables, and proceed. 

To illustrate a slightly different approach, however, let us simply fit 
Eqs. (107) and (108) to the observed values of the parameters k x and k 2 
of Table XIII. Thus, we are able to estimate the magnitudes of the desired 
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parameters of Eqs. (107) and (108). The results of such a linear least-squares 
fit are contained in Table XIV. This table suggests that Eq. (105) should be 

TABLE XIV 

Parameter Estimate Summary 


Parameter 

rtf 

tf 

la k 0 a E, 

, kcal/gm-moIe“ 

kx 

2.08 ±0.16 

0.47 ±0.16 

26.59 ± 4.70 

35.2 ±4.2 

kz 

0.99 ± 0.20 

0.12 ±0.20 

11.26 ±0.65 

18.5 ±5.4 


“The numbers following ± signs represent 95% confidence intervals, 
modified to read 


where 


Cr — 


>C 2 


BO '■'MO L CO 


C 0 5 Is' 


C 2 A oC 0 co 5 k[-C AO k 


7 ( e ~ k 2CAot _ e ~ klAO Cc*) 


(109) 


k[ = 3.518 x io 11 e” 35200/ * r 
k ' 2 = 7.795 x 10V 185OO/Rr 


That is, the data suggest that Eqs. (101)—(104) should be modified to allow 
formation of F by a reaction second order in A , first order in B , and half-order 
in the catalyst concentration C. The disappearance of F appears to be first 
order in A and in B. 

Before the analysis is complete, we must examine the adequacy of the 
model to fit the data. Since this procedure is described in Section IV as well 
as in Reference (B14), this aspect of the fitting will not be pursued here. We 
have simply demonstrated here how the experimenter can be led from an 
inadequate model to an adequate model by the data analysis. 


VI. Empirical Modeling Techniques 

In every case in which a kinetic model is selected to represent adequately 
a reaction, the rate surface predicted by the model must be compared to the 
surface observed in the data. In the methods discussed in Section II, only one 
section through the entire rate surface was examined; for example, the 
dependence of initial rate on total pressure could be investigated when in 
fact the total rate surface constituted the dependence of rate on several 
component partial pressures and temperature. The misleading results obtain- 
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able by using sections of surfaces in this manner have been discussed 
(K3, K7). 

In this section, methods are described for obtaining a quantitative 
mathematical representation of the entire reaction-rate surface. In many 
cases these models will be entirely empirical, bearing no direct relationship to 
the underlying physical phenomena generating the data. An excellent empiric¬ 
al representation of the data will be obtained, however, since the data are 
statistically sound. In other cases, these empirical models will describe the 
characteristic shape of the kinetic surface and thus will provide suggestions 
about the nature of the reaction mechanism. For example, the empirical 
model may require a given reaction order or a maximum in the rate surface, 
each of which can eliminate broad classes of reaction mechanisms. 

A. Response-Surface Methodology 

The two primary features of response surface methodology are the 
experimental design and the method of data analysis (B7, D2, H3). In the 
design stages of the study, it is advantageous to use a minimum-variance 
design, such as the factorial or certain composite designs (H3). These designs 
not only cover a wide area of the kinetic surface, but they also can provide 
minimum-variance estimates of the parameters. 

The importance of the parameter estimates becomes apparent from the 
data analysis. Suppose a nonlinear reaction-rate equation contains two 
independent variables and a set of unknown parameters: 

r =f(x l , x 2 ; K) (110) 

The rate surface can be approximately represented, in the region where the 
data are taken, by a Taylor expansion 

r = 6 0 + b x x x + b 2 x 2 + 612 * 1*2 + b lx x 2 + b 22 x 2 ( 111 ) 

Here, the coefficients b represent first and second partial derivatives of the 
rate expression f(x u x 2 \ K) or functions thereof. 

This is the general equation for an ellipse in two independent variables, 
as shown in Fig. 21. If a new coordinate system is defined that has its center 
at the point S and axes directed along X x and X 2 of Fig. 21, then Eq. (Ill) 
reduces to 

r-r s = B lx X 2 + B 12 X 2 (112) 

Examination of the coefficients B , then, characterizes the rate surface. If, for 
example, B n and B 22 were significantly positive, any movement along the 
axes X x would increase the rate r. Hence, the surface has a minimum at point 
S . Similarly, if the coefficients were negative, a maximum exists at point S. 
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Fig. 21. Contours of constant reaction rate, Eqs. (Ill) and (112). 

A saddle point exists if the coefficients are of unequal sign (B7, D2). Precise 
estimates of the parameters are of considerable importance. 

The mathematical details of carrying out such a redefinition of coordinate 
system, termed a canonical transformation, have been presented elsewhere 
in detail (D2). 

1. Response-Surface Methods in Kinetic Modeling 

An augmented central composite design was used in obtaining reaction- 
rate data in a flow differential reactor; the reaction occurring was the iso¬ 
merization of normal pentane to isopentane in the presence of hydrogen 
(Cl). Using the subscripts 1, 2, and 3 for hydrogen, normal pentane, and 
isopentane respectively, an empirical rate equation can be written 

r = b 0 + b x x x +b 2 x 2 + b 3 x 3 + b u x x 2 + b 22 x 2 2 + b 33 x 3 2 

+ bi 2 x 3 x 2 + b i3 XjXj + b 23 x 2 x 3 (113) 

The parameter estimates obtained by a linear least-squares analysis of the 
original coded data (Cl) are shown in Table XV. After a canonical analysis, 
this equation becomes 

r - 2.11 = 0.155*! 2 - 0.292X 2 2 + 0.4192T 3 2 (114) 

The standard errors of the coefficients are, respectively, 0.094, 0.108, and 
0.095. The center of the coordinate system is at x t = 1.32, x 2 = 1.43, and 
x 3 = 2.08. 
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TABLE XV 

Parameter Estimates for Eq. (113) 


Parameter 

Estimate (with standard error) 

b 0 

3.06 ± 0.20 

b i 

-0.490 ±0.12 

b. 

1.26 ±0.12 

b i 

—1.50 ± 0.12 

in 

-0.077 ±0.12 

b 2 2 

-0.208 ±0.07 

bi* 

0.257 ±0.12 

b\i 

-0.241 ±0.12 

b\z 

0.505 ±0.14 

^23 

-0.158 ±0.14 


From a consideration of either Eqs. (113) or (114) (K3), it is evident 
that a saddle point is predicted from the fitted rate equation. This could 
eliminate from consideration any kinetic models not capable of exhibiting 
such a saddle point, such as the generalized power function model of Eq. (1) 
and the several Hougen-Watson models so denoted in Table XVI. 

However, this particular experimental design only covered values of x 3 
up to 1.68; consequently, the saddle point is only predicted by the model 
and not exhibited by the data. This is the reason the lack-of-fit tests of Section 
IV indicated neither model 3 nor model 4 of Table XVI could be rejected 
as inadequately representing the data. As is apparent, additional data must 
be taken in the vicinity of the stationary point to confirm this predicted 
nature of the surface and hence to allow rejection of certain models. This 
region of experimentation (or beyond) is also required by the parameter 
estimation and model discrimination designs of Section VII. 

Response-surface methodology has also been used to gain theoretical 
insight into a reacting system (B19) and to determine the order of a reaction 

(P3). 

2. Re spouse-Surface Methods in Process Exploitation 

Response-surface methodology has been used extensively for determining 
areas of process operation providing maximum profit. For example, the 
succinct representation of the rate surface of Eq. (114) indicates that increasing 
values of X 3 will increase the rate r. If some response other than reaction 
rate is considered to be more indicative of process performance (such as cost, 
yield, or selectivity), the canonical analysis would be performed on this 
response to indicate areas of improved process performance. This information 



TABLE XVI 


Model 

1. Adsorption controlled 
single site 


2. Adsorption controlled 
with dissociation 


3. Single site, surface 
reaction controlled 


4. Dual site, surface 
reaction controlled 


5. Dual site, surface reaction 
controlled, hydrogen 
dissociated 

6. Surface reaction controlled, 
dissociated w-pentane 

7. Desorption controlled 


Possible Rate Models for 


k(x 2 — x 3 /K) _ 

[1 + K x x x + (K 2 /K + K 3 )x 3 ] 


_ k(x 2 — x 3 jK) _ 

[1 + K lXl + (K 2 }Kx 3 )* + K 3 x 3 ] 2 

kK 2 (x 2 - x 3 /K) 

(1 + KiXi + K 2 x 2 + K 3 X 3 ) 

kK 2 (x 2 — x 3 /K) 

(\ + K iX i + K 2 x 2 + K 3 x 3 Y 

kK 2 (x 2 - x 3 /K) 

[1 + (^Jt ,) 1 ' 2 + K z x 2 + K 3 x 3 ] 2 

kK 2 (x 2 -x 3 /K) 

[1 + KiXi(K 2 x 2 ) i/2 + K 3 x 3 ] 2 
kK(x 2 - x 3 /K) 

[\+K iXi + (K 2 + KK 3 )x 2 ] 


L/i 

OO 


Isomerization 

Rate surface for model 

No stationary point except at equilibrium; first deriva¬ 
tive with Xi zero only at equilibrium; first derivative 
with x 2 always positive and with x 3 always negative 
except at equilibrium. 

No stationary point except at equilibrium; first deriva¬ 
tive with zero only at equilibrium; first derivative 
with x 2 always positive; first derivative with x 3 may be 
zero with second derivative positive. 

No stationary point except at equilibrium; first deriva¬ 
tive with Xi zero only at equilibrium; first derivative 
with x 2 always positive and with x 3 always negative 
except at equilibrium. 

Can have saddle point; first derivative with x t zero at 
equilibrium; first derivative with x 2 may be zero with 
second derivative negative; first derivative with x 3 may 
be zero with second derivative positive. 

Same as model 4. 


Same as model 4. 


Same as model 1. 


KITTRELL 
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is obtained in addition to the modeling information described above. Hence, 
the response surface approach can yield dividends in both areas. The use 
of response surface methods for general process improvement (B7, D2) and 
for investigating the influence of process variables and catalysts on reactions 
(F2, F3) has been discussed. 

B. Transformations of Variables 

Particularly in empirical modeling, the transformation of a model can 
greatly assist in the modeling procedure. In some cases, a transformation can 
simplify the functional form of the model; in others, it can provide an 
improved fit of the data while requiring a previously specified function form 
(e.g., a power-function model). In still other cases, the transformations may 
ensure that certain assumptions are satisfied so that a simple and valid 
analysis may be performed. For example, it may be desirable to fit the 
logarithm of a dependent variable y by unweighted least squares rather than 
fitting^ itself if the log function has constant variance. In this section, methods 
are discussed for determining transformations of the dependent variable, the 
independent variables, or both. Transformations of the parameters within a 
model were discussed in Section III. 

1. Transformations of Dependent Variable 

Theory for the transformation of the dependent variable has been pre¬ 
sented (Bll) and applied to reaction rate models (K4, K10, M8). In trans¬ 
forming the dependent variable of a model, we wish to obtain more perfectly 
(a) linearity of the model; (b) constancy of error variance, (c) normality of 
error distribution; and (d) independence of the observations to the extent 
that all are simultaneously possible. This transformation will also allow a 
simpler and more precise data analysis than would otherwise be possible. 

Let us write an integrated ath-order rate equation as 


1 

« '♦»» 

II II 

1 —t /“—S 

1 ! 

a 

^ £ 

1 _ 1 

<3 

t o 

1 )kt a # 1 

a = 1 

(115) 

If 



A = a — 1 
y = (1 — x) -1 


(116) 

then the rate equation may be written 



y w = kt 


(117) 

where 



vW /(/ - WC A o x 
“Uny 

A ^ 0 

A — 0 

(118) 
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Now, if the transformed variable is normalized to Z (A) by the Jacobian 
of the transformation, we obtain 


7W = i(y x -mf- 1 

\y In y X — 0 


(119) 


where y is the geometric mean of the y 9 s. Then it has been shown (B11, K10) 
that the steps required for effective reaction-order estimation are: 

1. Estimate, by unweighted linear least squares, the parameter b = kjj 1/N , 
which minimizes the sum of squares 

S(Q = s (Zi w - Bh) 2 (120) 

i= 1 

for a given X, and calculate the sum of squares S(X). 

2. Plot this minimum sum of squares for several X. 

3. Read off the minimum of this plot to obtain the best X, X. 

4. Calculate the 99 % confidence interval for this X by 

In S(X) - In S min < *i 2 (0.01)/iV = 6.63 /N (121) 

If this procedure is followed, then a reaction order will be obtained which is 
not masked by the effects of the error distribution of the dependent variables 
If the transformation achieves the four qualities (a-d) listed at the first of this 
section, an unweighted linear least-squares analysis may be used rigorously. 
The reaction order, a = X 4- 1, and the transformed forward rate constant, 6, 
possess all of the desirable properties of maximum likelihood estimates. 
Finally, the equivalent of the likelihood function can be represented b: the 
plot of the transformed sum of squares versus the reaction order. This 
provides not only a reliable confidence interval on the reaction order, but 
also the entire sum-of-squares curve as a function of the reaction order. 
Then, for example, one could readily determine whether any previously 
postulated reaction order can be reconciled with the available data. 

This method has been discussed in detail (K10) and extended to cover 
other important cases elsewhere (K4, M8). It can logically be asked why such 
a precise reaction order is wanted. This and several other points in the analysis 
can best be shown by the following example. 

Pannetier and Davignon (PI) studied the solid-solid reaction 


NiS 2 {s) -> NiS(j) + iS 2 (g) (122) 


They presented seven determinations of the mass of unreacted solid at each 
of four temperatures. By the method of differentiation, Pannetier and 
Davignon (PI) found the reaction to be two-thirds order. 

Typical sum-of-squares contours are shown in Fig. 22, where Z (A) re¬ 
presents the transformed dependent variable. The vertical broken lines 
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represent the 99% confidence interval. The results are shown below in com¬ 
plete form in Table XVIL Note that the two-thirds order is not compatible 
with the 415°C data. Although this order is adequate for the other data sets, 
a rather disturbing increase in the order with an increase of the temperature 
is present. In fact, the 395°C data alone would commonly be considered to 
be one-half order, with a relatively low probability of the two-thirds order 
being appropriate. This indication can be observed only for precise measures 
of the reaction order and its confidence interval. An analysis of variance for 
these data has been developed (K10). 


TABLE XVII 

Estimates of Reaction Order for Eq. (122) 


Temperature 

Reaction order (with 99% 

CO 

confidence interval) 

395 

0.53 ± 0.25 


0.61 ± 0.05 

415 

0.91 ±0.13 

426 

0.69 ± 0.09 


If a single reaction order must be selected, an examination of the 95% 
confidence intervals (not shown) indicates that the two-thirds order is a 
reasonable choice. For this order, however, estimates of the forward rate 
constants deviate somewhat from an Arrhenius relationship. Finally, some 
trend of the residuals (Section IV) of the transformed dependent variable 
with time exists for this reaction order. 

2. Transformations of the Independent Variables 

In transforming the independent variables alone, it is assumed that the 
dependent variable already has all the properties desired of it. For example, 
if the y *s are normally and independently distributed with constant variance, 
at least approximately, then any transformations such as described in Section 
VI,B,1 would be unnecessary. Under such assumptions, Box and Tidwell 
(B17) have shown how to transform the independent variables to reduce a 
fitted linear function to its simplest form. For example, a function that has 
been empirically fitted by 

y = b 0 + b x x + b 2 x 2 (123) 

might be fitted more simply by 


y = b In x 


( 124 ) 
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Fig. 22. Sum of squares curve for 395°C NiS 2 decomposition. 


if the experimenter knew that it were preferable to transform x to In x. 
A type of transformation explored in detail is 


r f — X V a i ^ 0 

l \ = lnx* a, = 0 


(125) 


This transformation thus includes the familiar reciprocal, square-root, and 
logarithmic transformations. Box and Tidwell have shown how the functions 

y^biUt (126) 

or 

y = YuY, b ij u i Uj ( 12 ?) 

can be fitted iteratively. We examine the usefulness of this approach more 
fully in Section VI,B,3, immediately following. 


3. Transformations of Independent and Dependent Variables 


Combinations of the above two methods lead to equations of the form 
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(H2, W6): 

y w = Z> 0 + £ A, U t 

(128) 

where 

f = (/■ - i)M i # o 

( = lnj 2 = 0 

(129) 

and Ui is given by Eq. (125). Here, if y is taken to be the reaction rate, then 
any rate expression from 


r = k PV 

i - i 

(130) 

to 

r = t biPV 

i 

(131) 

may be determined. If, on the other hand, 


y = 

reaction driving force 

r 

(132) 


then, clearly, Eq. (128) can represent any of the hyperbolic models. 

As an illustration, consider the hydrogenation of propylene over a 
platinum alumina catalyst discussed in Section II. These date were taken 
from 0 to 35°C, 1 to 4 atm total pressure, and 0 to 45 % propylene. Equations 
(7) and (8) were obtained after considerable sifting and winnowing of rate 
equations. Both fit the observed data reasonably well. 

An entirely empirical but very rapid technique is to fit 

r w = b 0 + b lP V + b 2 p? + & 3 T a> (133) 

where p t and p 2 are partial pressures of hydrogen and propylene, respectively. 
By first using nonlinear least squares to estimate the parameters on the right- 
hand side of Eq. (133) at each of several values of A, it was found that a y and 
a 2 should be zero, but that the values X and a 3 were poorly defined. Con¬ 
sequently, the sums-of-squares contours similar to that of Fig. 22 were 
plotted as shown in Fig. 23 at the 95% confidence level. It is apparent that 
the effect of temperature on these data is not well defined, because a wide 
range of values of a 3 represents the data as well as the minimum sum-of- 
squares value. Choosing convenient values within the 95% confidence 
region, we obtain the rate equation 

In r = 19.55 + 0.692 In p y - 0.226 In p 2 - 6.41 x 10 3 T’ 1 (134) 

or 

^aoP?’ 692 ^ 0 ’ 226 exp (-6.41 x lO 3 ^ 1 ) (135) 
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A comparison of the predicted rates of Eqs. (7) and (135) is shown in Fig. 24. 
Since the bulk of the data were taken above 5% propylene, it is apparent that 
both models fit reasonably well. However, at lower concentrations of propy¬ 
lene, Eq. (135) will deviate widely from the data. Undoubtedly, deviations 
will also occur when extrapolating other directions from the data base as 
well. Checks on the adequacy of the transformation and calculations of the 
confidence regions for all parameters may also be carried out (B11, B17, H2). 

The advantage of the approach is thus seen to be the speed and complete¬ 
ness with which a large number of empirical models can be tested and the 
best model of the group selected for closer examination. This empirical 
model may then be used as an end result, or it can serve as a starting point 
for more exhaustive analysis such as described in Sections III, V, and VII. 

C. Empirical Model Tuning 

In the correlation of kinetic data, one may spend considerable time and 
effort obtaining a theoretical model using the techniques presented in the 
accompanying sections. Alternatively, one may simply fit an empirical 
function to the data, using the several techniques already discussed in this 
section. Many cases between these extremes are met in practice however. 
This subsection discusses procedures for empirically modifying an approxi¬ 
mate mechanistic model such that (a) the function form of the mechanistic 



Fig. 23, Approximate 95 % confidence region in A — a 3 space for Eq. (133). 
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model can be retained; (b) the resulting model can describe the main features 
of the existing data; and (c) the model can possess a minimum number of 
parameters. This is done using the parametric residuals of Section IV,B,6, 
and already exploited in Section V. 

Mickley, Sherwood, and Reed (M9) have discussed a useful method of 
empirically modifying a function to improve its fit of experimental data. 
The method would be applied when a particular rate equation 

>Wi(x;K) (136) 

does not adequately describe the observed reaction rates, y. The procedure 
is to fit the residual of the dependent variable as an empirical function of the 
independent variables. If this empirical function were chosen to be linear, 
for example, we would estimate the empirical parameters B in 

y - r x = xB (137) 

After estimating the parameters B, the resulting equation 

r \ =/i(x; K) + xB (138) 

may fit the data better than does Eq. (136). There are obvious disadvantages 
to this procedure for kinetic modeling. For example, we lose the advantage 
of the use of the functional form f t (x; K), which would generally be selected 



Fig. 24. Predicted reaction rates for Eqs. (7) and (135). 
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to provide reliable interpolation of kinetic data [e.g., Eqs. (1) or (2)]. Also, 
we add additional empirical constants B to Eq. (136), so that the residual 
mean square may actually be increased by following this procedure. Let us 
now describe how this general procedure may be utilized using the intrinsic 
parameters. In this way, the main behavior of, say, the Hougen-Watson 
model can be retained while making slight empirical modifications needed 
to fit the data. Also, the empirical modifications can be made without necess¬ 
arily increasing the number of parameters in the model. 

Let us consider the data taken by Laible (LI) on the dehydration of normal 
hexyl alcohol at 450°F over a silica alumina catalyst. The single- and dual¬ 
site surface reaction controlled models applying to alcohol dehydration were 
discussed in Section V,A,2. We now consider, however, the functional forms 
given, for example, by Eq. (84), as probably being capable of describing the 
data, but do not restrict the and C 2 plots to a linear pressure dependence 
as before. Rather, we obtain an empirical pressure dependence from the 


n-Hexyl Alcohol 



Fig. 25. Plot of parameter C x versus pressure for /i-hexyl alcohol dehydration, Eqs. 
(80) and (81). 
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data. Figure 25 presents the single- and dual-site C 1 plots; it can be seen that 
either model is well described by 

£! = a + B x n (139) 


The C 2 plots of Fig. 26, however, indicate that a linear C 2 dependence upon 
pressure will not be adequate. The use of the dual-site model will require three 
additional parameters for C 2 , in order that the model approximately describe 
the data. The single-site model, however, can be used if we take 

C 2 =a + B 2 n 2 (140) 

where the parameter a is contained in both Eqs. (139) and (140). Thus, a 
model with only three parameters, which describes the main features of the 
data, is (K4) 


(1 — x)n 

(3 + 15;r) + (3 + 15.57 c 2 )x 


(141) 


Although we have intentionally chosen a simple example here to illustrate 
the method, we note that the same result could be obtained by a regression 
of any of the parameters of Sections IV,B or V on the independent variables. 
This procedure is then analogous to that described by Eqs. (136)—(138) 
except with the parameters as dependent variables. 



Fig. 26. Plot of parameter C 2 versus pressure for «-hexyl alcohol dehydration, Eqs. 
(80 and (81). 
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VII. Experimental Designs for Modeling 

The importance of a sound experimental design in modeling has already 
been emphasized. Normally, however, one cannot construct a detailed plan 
for the conduct of an entire future experimental program. At the outset of 
many kinetics experiments, we are not only unaware of the ranges over which 
certain variables should be studied but also of which variables should be 
studied. Often, it is only after observing unexpectedly wide data scatter or 
unacceptably rapid catalyst deactivation that we begin to appreciate the 
importance of certain variables. Consequently, we generally take a limited 
set of data and use the results to guide our next experiments. 

Box recognized that not only do most kinetics experiments proceed in 
such a sequential fashion, but indeed most scientific experiments proceed 
iteratively. In his descriptions of the iterative nature of experimentation 
(B9), he has identified the stages of conjecture , design , experiment , and 
analysis . In such experimentation, one designs experiments to test a certain 
conjecture, conducts the experiment and analyzes the data. From this 
analysis, one gains information leading to new conjectures. The sequence is 
then repeated, making full use of all the information available at any given 
stage in planning experiments for the next stage. This strategy is particularly 
valuable in kinetic modeling studies. 

It is partly the fault of statistics that experimenters have misconstrued 
the value of the number and precision of data points relative to the value of 
the location of the points. The importance of the location of the data in the 
model specification stage can be seen from Fig. 1, which represents literature 
data (M3) on sulfur dioxide oxidation. The dashed and solid lines represent 
the predicted rates of two rival models, and the points are the results of two 
series of experimental runs. It can be seen that neither a greater number of 
experimental points nor data of greater precision will be of major assistance 
in discriminating between the two rival models, if data are restricted to the 
total pressure range from 2 to 10 atm. These data simply do not place the 
models in jeopardy, as would data below 2 atm and greater than 10 atm total 
pressure. This is presumably the problem in the water-gas shift reaction, 
which is classical in terms of the number of models proposed, each of which 
adequately represent given sets of data. 

The proper location of data is also important in parameter-estimation 
situations. For the nitric oxide reduction reaction (K11), for example, the 
relative sizes of the three-dimensional confidence regions calculated after 
each observation are shown in Fig. 27. The size of the confidence region after 
12 points taken according to a one factor at a time variation of hydrogen 
and nitric oxide partial pressures is seen to be equivalent the size of the region 
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after a 2 2 factorial followed by one point taken by a minimum-volume design. 
The size of the region would have been substantially smaller had the first 
three points been taken by a Box-Lucas design (B16), followed by the 
minimum-volume design for the next two points. Note, from Fig. 27, that 
18 times more information is obtained from 12 minimum-volume design 
points than from the 12 one-factor design points. This is a very significant 
increase in information; it has been shown, for example, that an increase in 
the precision of an estimated rate constant by only a factor of 4 can reduce 
the reactor overdesign required for such uncertainties by 15% of the reactor 
volume (K6). 

Experimental design procedures exist for discriminating among m rival 
models or estimating parameters in kinetic models. Figure 28 indicates three 
degrees of sophistication of such techniques that are available. At one 
extreme, the experimenter can simply use his intuition and experience, for 
example in suggesting that a tenfold variation from 1 to 10 atm total pressure 
should be adequate. Or, perhaps, high and low concentrations may be used. 
It is an unfortunate result of human nature that such experimental designs 



Fig. 27. Relative size of confidence region versus experiment number for nitric oxide 
reduction, Eq. (149). 
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may be influenced more by convenience than necessity; in any event, this 
method is best utilized by a gifted few. In very simple situations, such as the 
case in Fig. 1 with two models and one independent variable, a few hand 
calculations will suggest the proper regions of experimentation. For this 
example, given the experimental data of the figure and the scientific knowledge 
of two rival models, parameter estimates could be obtained and the predicted 
reaction rates sketched as shown in Fig. 1. These hand calculations, then, 
suggest that data at higher and lower pressure would be informative. The 
more usual situation, however, is where several complex models exist, each 
being influenced by several independent variables. In such cases, computer 
studies can best consider all of the available data and scientific knowledge, 
to evaluate the numerous possible alternatives. 

The general scheme required here is shown in Fig. 29, a generalization of 
Fig. 28. With some ideas as to good areas of experimentation, the experi¬ 
menter takes an initial set of data. These data are then analyzed to determine 
the best estimates of the parameters of the model or models under considera¬ 
tion. Since models that usually arise in these circumstances are nonlinear in 
the parameters, some version of nonlinear estimation will usually be em¬ 
ployed in this analysis. Nonlinear estimation techniques, of course, almost 
always require the use of a computer. 

Following this data analysis, the next step is the selection of a further 
set of runs to be performed; this also usually requires the use of a computer. 
For the two areas of modeling being discussed here, model discrimination 
and parameter estimation, this step involves maximization of the design 
criterion appropriate for each case. Settings of the independent variables, 
which extremize the criterion, are therefore determined, and these settings 
constitute the experimental design according to which the next set of runs 



Fig. 28. General methods of selecting model discrimination or parameter estimation 
designs. 
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are performed. Once the data are collected, reestimation of the parameters 
follows. This cycle can be continued a sufficient number of times to accom¬ 
plish the modeling objective (either model discrimination or parameter 
estimation). The strategy makes heavy use of the computer, there being a 
continual iteration between computer and laboratory. 


A. Model-Discrimination Designs 

Suppose that a given reacting system is about to be studied for which 
there exists a number of rival models; the object of the experimentation is 
here presumed to be the elimination of the inadequate models. Through 
this series of experiments, then, the experimenter is attempting to arrive 
at the best mathematical model for his system. The original paper (HI3) 
in this area suggested sequential designs for discriminating between m — 2 
models. Box and Hill (B13) later showed how to discriminate between an 
arbitrary number of rival models. In this later generalized approach, the 



Fig. 29. Experimental strategy for mechanistic modeling. 
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experimenter iteratively selects those experimental conditions for the next 
stage of experimentation according to the following criterion: 


D = 'Z Z n J D u 

i= 1 J= 1 


(142) 


where 


*41 


(& 2 - tj 2 ) 2 


(a 2 + a 2 )(a 2 + &/) 


+ (h - 9j) : 


l 


A - + 


1 


V + Z? o + &/) J 


The criterion D is a measure of divergence among the models, obtained 
from information theory. The quantity 7i t is the prior probability associated 
with model i after the «th observation is obtained; a 2 is the common variance 
of the n observations y (\), y{2 ), •••, y(n — 1), y(«); 6 2 is the variance for 
the predicted value of y(n + 1) by model i. When we have two models, D 
simplifies to 


D = 


(& 2 2 - ^l 2 ) 


(tx 2 + & t 2 )(a 2 + & 2 2 ) 


+ (j)l " h)2 {^TJ? + ^Ta?) (143) 


Note from Eq. (142) that this discriminant is large when y ( is far from $ j9 
& 2 and &/ are small, and n t and 7ij are jointly large. This means that in 
order to decide which of several models is adequate, we should seek out 
settings of the independent variables for which the responses of the various 
models are expected to be quite different [(j) f — yj) large] and for which the 
responses can be predicted relatively precisely (a 2 small). Also, we should 
not give much weight to models that we suspect are relatively poor ( 7 Tij 
small), even though the two models predict responses far from one another. 
This discrimination procedure, then, utilizes all of these concepts to provide 
effective model discrimination designs. In particular, after N observations 
become available, the quantity D will be maximized with respect to the 
settings for the next experimental run, N + 1, and these maximizing values 
will be used for the (N 4- l)th experiment. This procedure could be con¬ 
tinued until one model was found to be clearly superior to the others. This 
iterative discrimination procedure can thus be represented by Fig. 29, where 
the appropriate design criterion is taken to be, at each stage, the maximization 
of D as defined in Eq. (142). 

As an illustration of the power of the method, two models known to fit 
the propylene hydrogenation data are shown in Eqs. (7) and (8). (See also 
Sections II,B and VI,B,3). Let us denote Eq. (7) by model 1 and Eq. (8) by 
model 2. 

An initial nine data points were taken at 35°C in an adiabatic flow reactor. 
The initial prior probabilities were taken to be equal (equal probability of 
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adequacy) and cr 2 was estimated from experimental data (H12). Because of 
rapid catalyst deactivation and equipment limitations, total pressure was 
maintained at or below 3 atm. 

Figure 30 portrays the grid of values of the independent variables over 
which values of D were calculated to choose experimental points after the 
initial nine. The additional five points chosen are also shown in Fig. 30. 
Note that points at high hydrogen and low propylene partial pressures are 
required. Figure 31 shows the posterior probabilities associated with each 
model. The acceptability of model 2 declines rapidly as data are taken 
according to the model-discrimination design. If, in addition, model 2 cannot 
pass standard lack-of-fit tests, residual plots, and other tests of model 
adequacy, then it should be rejected. Similarly, model 1 should be shown to 
remain adequate after these tests. Many more data points than these 14 
have shown less conclusive results, when this procedure is not used for this 
experimental system. 

For the multiresponse situation, several measurable responses are implicit 
in each model under consideration. For example, for the reaction 


A 


-»C 


several models could be postulated, one of which could be that each com¬ 
ponent undergoes a first-order decomposition. For each such model, several 
responses could be measured as a function of time, e.g., the concentrations 
of components A, B, C, and D. Just as there is substantial advantage in 
making use of each of these responses in parameter estimation (Section 
III,D), considerable information about model discrimination will be lost 
unless all of the possible responses are utilized. The model-discrimination 
procedure for several responses has been developed (H10) and exemplified 
(H14). 


B. Parameter-Estimation Designs 

Once model discrimination has been accomplished (that is, from a 
large group of rival models one best model has been selected as being ade¬ 
quate), further experimentation can be conducted to improve parameter 
estimates. For such parameter-estimation experiments, one design criterion 
suggested is to choose experiments providing the most desirable posterior 
distribution of the parameters. Under certain assumptions (B15), the proce¬ 
dure reduces to the maximization of the determinant 


A - |X'X| 


(144) 
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where the (/, w)th element of this N x pX matrix is 

5/(x„;K| 


f'iu = 


dK, 


K = K° 


It is to be noticed that for linear models, such as 

/(x u ;K) = K 1 Xi b + K 2 x| u 


(145) 

(146) 


then Eq. (145) becomes 

f lu -^lu 
f'iu = xiu 

The determinant is then, for N data points, 


A = 


£*?« ZXiuXlu 


M=1 


«= 1 


JV N 

Z X 2u 

U - 1 u - 1 


(147) 


(148) 


For this linear model, then, we simply find values of and x 2 that maximize 
A. We do not need estimates of the parameters to define this minimum 
variance design. For nonlinear models, such as Eq. (40), this is not true. The 



Fig. 30. Settings of independent variables for propylene hydrogenation for discrimin¬ 
ating between Eqs. (7) and (8). 
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criterion A will be a function of the parameter values and, hence, the optimal 
design depends on these values; initial parameter estimates must be pro¬ 
vided to use this criterion (B16). 

By maximizing the determinant A, we can expect to have a smaller con¬ 
fidence region of the parameter estimates than that obtained with any other 
experiment in the possible region of experimentation. The larger the con¬ 
fidence region, of course, the more uncertain is our knowledge concerning 
the estimated parameters; the smaller the confidence region, the more precise 
is our knowledge. 

The iterative parameter-estimation procedure can also be represented as 
in Fig. 29. An experimenter performs an experiment, obtains the data and 
estimates the parameters, for example, by nonlinear least squares. Then he 
checks the confidence region of the parameter estimates, perhaps through 
A as shown in Fig. 27. If it is too large, the experimenter finds those experi¬ 
mental settings that maximize the parameter estimation criterion of Eq. (144), 
that is, experimental settings that will reduce the size of confidence region as 


n, =o.5 n 2 =o.5 



After II Runs II, =0.25 I7 2 =0.75 

1 _ 

I 2 


After 12 Runs 


After 13 Runs 



Model 2 Model I 


Fig. 31. Posterior probabilities in discrimination of propylene hydrogenation models 
of Eqs. (7) and (8). 



n, = 0.05 n 2 =o.95 
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much as possible. He then conducts an experiment as indicated by this 
experimental design, analyzes the data, and checks whether the parameters 
have now been estimated with sufficient precision. If they have, he terminates 
the experimentation; if not, he repeats the cycle. 

To illustrate this criterion, consider a simulation problem chosen to be 
similar to the catalytic reduction of nitric oxide (A3, K11): 

N0 + H 2 -H 2 0 + iN 2 

In an experimental study (A3), the reaction model was found to be the surface 
reaction between an adsorbed nitric oxide molecule and one adjacentlyl 
adsorbed hydrogen molecule: 


kK^ 0 K H2 p N0 p Hz 
(1 + X NO p NO + X H2 p H2 ) 2 


(149) 


The confidence region obtained using a simulated one variable at a time 
design was first examined, since this was the design used by the original 
experimenters. 

The approximate 95% confidence region for the non-linear least-squares 
estimates using these data is shown in Fig. 32. The surface represented in this 
figure is the contour of the sums of squares surface, which has the value S c 
given by Eq. (56). 

In order to obtain more precise estimates of these parameters, the experi¬ 
mental design was also constructed using the above parameter estimation 
criterion. Using four preliminary observations taken according to a factorial 
design, the three parameters ( k , X NO , and K Hl ) were estimated. A fifth point 
was selected to minimize the joint confidence region of these parameters, 
using the settings of the partial pressures maximizing A. Following Fig. 29, 
the simulated reaction rate at this fifth point was used to reestimate the 
parameters by nonlinear least squares. Then the sixth point was chosen by 
again maximizing the determinant A, and the entire procedure was repeated. 

Since the square root of the determinant A is inversely related to the size 
of the confidence region, the rate of decrease in volume obtained by this 
design procedure may be represented by Fig. 27. The solid horizontal line 
is the size of the region obtained by the twelve one variable at a time experi¬ 
mental points. In contrast, the estimates from the minimum-volume design 
are equally precise after the fifth point (i.e., the first point chosen by the 
minimum volume design). A visual measure of the size of the confidence 
region after twelve well-designed points can be obtained from Fig. 33, 
which is one-eighteenth the size of the one variable at a time confidence 
region of Fig. 32. 

This parameter-estimation technique has also been extended to the 
multiple-response case (D3). Just as was seen in the multiple-response 
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Fig. 33. Approximate 95% confidence region for Eq. (149) and sequential minimum 
volume design. 
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parameter-estimation example (B12), much greater power is available to the 
experimenter when all available responses are used. Also, a method has been 
suggested (H4) for combining the model-discrimination and parameter- 
estimation criteria, in order to increase the efficiency of the overall modeling 
program. 

It will be noticed from Fig. 29 that there is a continual iteration between 
the laboratory and the computer. In the laboratory area, there could be a 
remote console linked directly to a time-shared digital computer, although 
immediate laboratory access is not essential. After obtaining his results, the 
experimenter would enter these into the digital computer, and the computer 
would perform routine analyses that have been programmed. For example, 
least-squares estimates would be calculated and the confidence region deter¬ 
mined, for each model. Graphical output in the form of various plots of the 
residuals could also be obtained and examined. The computer could then 
be asked to determine those settings for a further run that would extremize 
the appropriate design criterion. The computer, having executed this instruc¬ 
tion, would print out these optimal settings for consideration by the experi¬ 
menter. The experimenter, in turn, could use these settings to proceed with 
further experimentation, or ask for other possible settings. 

There is danger in describing this procedure in such an oversimplified 
way; it could be misconstrued that the investigator may only set variables 
and run experiments as the computer dictates. Experimentation in kinetics 
is far too complex for any such naive approach as this. In computer-assisted 
sequential experimentation, it is still necessary that the experimenter provide 
new ideas, intuition, and insights, which could possibly change the direction 
of the investigation in a fundamental manner. In short, the experimentation 
cannot be completely programmed ahead of time. Natural discoveries and 
surprises are the essence of research, and any experimental strategies for 
planning experiments must take them into account. However, the continual 
iteration between computer and laboratory, while not solving the problems 
completely, can contribute much in the way of solving modeling problems. 


Nomenclature 


A. 

A component of a reacting 
mixture 

Bi 

An empirical parameter in 
Eqs. (139) and (140), / = 1, 2 

a 

A reaction order to be esti¬ 
mated for component A 

B it 

The canonical parameters de¬ 
fining the nature of the re¬ 

at 

An empirical exponent of 

Eq. (133) 


sponse surface in trans¬ 
formed coordinates, 

d 

An empirical parameter in 


/-l,2, 3,— 


Eqs. (139) and (140) 

B 

A vector of empirical para¬ 

B 

A component of a reacting 
mixture 


meters of Eqs. (137) and (138), 
obtained by linear regression 
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b 
b , 
b 

b 

b 

b 

C 

Ct 


C, 


Ci 


c ., 

c, 

Cao 

C L 

C Ai 


c 


cov(6i, b 2 ) 
D 


of residuals on x 
A reaction order to be esti¬ 
mated for component B 
An estimated value of a para¬ 
meter 

A combination of rate and 
absorption constants, defined 
by Eqs. (94), (95), and (99) 

A transformed rate constant 
for Eq. (115), obtained by 
dividing the rate constant by 

Jl/N 

Least-squares estimate of a 
parameter 

A vector of parameters to be 
estimated 

Designates a component of a 
reacting mixture 
An observed value of an 
intrinsic parameter, which 
collects all denominator terms 
of hyperbolic models not mul¬ 
tiplied by conversion 
The value of predicted by 
an assumed model 
An observed value of an 
intrinsic parameter, which 
collects all denominator terms 
of hyperbolic models multi¬ 
plied by conversion 
The value of C 2 predicted by 
an assumed model 
Concentration of any reactant 
A 

Concentration of any reactant 
A at zero time 

Molar concentration of vacant 
active sites on catalyst 
Concentration of any com¬ 
ponent A at the ith data point, 
i = 1, 2, • • •, JV 
Concentration of component 
A at ith data point predicted 
by a particular model 
Variance-covariance matrix 
of observation errors, defined 
in Eq. (39) 

Covariance of two para¬ 
meters, b t and b 2 
Model discrimination criter¬ 
ion defined in Eq. (142) 


Du 

E 

E' 

EiCi-Ci) 


F 

flu 


I 

J 

K 


K t 


Ei 
K <‘) 


K 

K° 

k 

k' 

Jc 

k Q 


ko 


L 


Quantity related to the diver¬ 
gence D defined in Eq. (143) 
Activation energy 
Ratio of activation energy to 
universal gas constant, E/R 
Expected value of the residual 
of Ci 

The 100(1 —a) % point of 
the F statistic with (v iy v 2 ) 
degrees of freedom 
Mass feed rate to reactor 
Partial derivative of nonlinear 
rate model with respect to the 
ith parameter and evaluated 
at the «th experimental point 
Matrix of unity diagonal 
terms and zero off-diagonal 
terms 

Jacobian of the transforma¬ 
tion of the variable y to the 
variable y U) 

Thermodynamic equilibrium 
constant for over-all reaction 
Adsorption equilibrium con¬ 
stant for species /, or pro¬ 
duct of absorption constant 
and initial mole fraction of 
species i 

Adsorption equilibrium con¬ 
stant for species i 
Estimate of a parameter after 
ith iteration of nonlinear 
estimation program 
Vector of estimated rate and 
adsorption parameters 
Vector of initial estimates of 
K for use in nonlinear least 
squares 

Forward reaction-rate con¬ 
stant 

Reparameterized forward re¬ 
action-rate constant 
Forward reaction-rate con¬ 
stant 

Preexponential factor of for¬ 
ward rate constant 
Reparameterized preexpon¬ 
ential factor of forward rate 
constant 

Total concentration of active 
sites on catalyst 
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m Number of rival models 
N Number of experimental ob¬ 
servations 

n Exponent of denominator of 
generalized hyperbolic rate 
equation, such as Eq. (94), or 
the reciprocal thereof 
n Number of replicated data 
sets 

p Number of parameters in a 
model 

p t Partial pressure of species i in 
reaction mixture 
q Number of replicated points, 
used to estimate experimental 
error 

R Universal gas constant 
r Reaction rate 
r t Reaction rate predicted from 
model / 

r 0 Initial reaction rate 
r s Reaction rate at the station¬ 
ary point of the reaction-rate 
surface 

r„ v Reaction rate predicted from 
response v at the wth experi¬ 
mental point 

r w Transformed reaction rate, 
where the transformation is 
defined generally in Eq. (129) 
S Sum of squares of residuals, 
defined in Eq. (19) 

S c Critical sums of squares con¬ 
tour giving a 100(1 —a) % 
confidence region 

S m m Minimum sum of squares on 
a sum-of-squares surface, cal¬ 
culated by introducing least- 
squares parameter estimates 
into Eq. (19) 

S r Multiple-response sum of 
squares defined by Eq. (64) 

S A Multiple-response sum of 
squares defined by Eq. (65). 

5(A) Minimum sum of squares of 
residuals for a given value of 
A, defined by Eq. (120) 
s 2 Estimate of experimental 
error variance 

sq Standard error of parameter 
estimate 6 


T Absolute temperature 
T Mean value of absolute tem¬ 
peratures of a particular set 
of data 

t Measure of reaction time 
t 1/2 Half-life of a reaction 
( 0t « The 100(1 — a/2)% point of 
the t distribution with v 
degrees of freedom 
U t Transformed dependent vari¬ 
able defined by Eq. (125) 
v(bi) Variance of the parameter 
estimate b t 

v(b) Variance-covariance matrix 
of the vector of parameter 
estimates b 

v Designates the response num¬ 
ber in Eq. (65) 

W Mass of catalyst in reactor 
Xt The canonical axis /, 

*= 1 , 2 , 3 , • 

X A matrix of independent vari¬ 
ables, defined for a linear 
model in Eq. (29) and for 
nonlinear models in Eq. (43) 
X 1 Transpose of matrix X 
X” 1 Inverse of the matrix X 
x Conversion in reactor 
x t A generalized independent 
variable 

x t Predicted value of the reactor 
conversion, for model i 
x A matrix of the independent 
variables x t 

y A vector of the independent 
variables y t 

y Dependent variable related to 
conversion by Eq. (116) 
y t Generalized dependent vari¬ 
able 

y Geometric mean of any set of 
variables y t 

y Arithmetic mean of any set of 
variables y, 

y A o Initial mole fraction of com¬ 
ponent A 

y t Predicted value of the gen¬ 
eralized dependent variable 
yt 

y A) Transformation of y , given 
by Eq. (118) 



MATHEMATICAL MODELING OF CHEMICAL REACTIONS 


181 


y” 

Observed value of the re¬ 

A 

A transforming parameter re¬ 


sponse v at the nth experi¬ 


lated to the reaction order 


mental point 

V 

Number of degrees of freedom 

z 

Dependent variable defined 

IT 

Total pressure 


by Eq. (77) 

TTl 

Posterior probability asso¬ 

z 

Predicted value of z 


ciated with model i 

Z (A) 

Normalized transformed de¬ 

o 2 

Experimental error variance 


pendent variable given by 

o 2 

Experimental error variance 


Eq. (119) 


for ith experimental point 

a 

Confidence level of F or t 

O r 2 

Variance of the reaction rate, 


distributions 


r 

Pi 

True value of an/th parameter 

o 2 

Variance of the predicted rate 

P 

Vector of parameter values j3 t 


from model i 

Yi 

Rate constants of Eq. (7), 

Xi 2 («) 

The 100(1 —a) % point of 


* = 1,2 


the chi-squared distribution 

A 

Determinant defined in Eq. 


with 1 degree of freedom 


(65) 


A parameter setting the rela¬ 

e 

Experimental error 


tive contributions of the 

8 

Estimated value of any para¬ 


linearization and steepest 


meter 


descent methods in deter¬ 

A 

A nonintrinsic parameter de¬ 


mining the correction vector 


fined in Eqs. (77) and (78) 


b of Eq. (45) 
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I. Introduction 

The use in recent years of mathematical models of quite large scale physical 
processes has confronted the engineer with the task of solving large systems 
of equations. The system of n real equations represented formally as 

x 2 ,...x„) = 0 

f 2 (x 1 ,x 2 ,...x n ) = 0 .J) 

fn(x i, X 2 , .. . X n ) = 0 

in which one or several of the equations may describe one piece of equipment 
can almost never be solved in closed form. If some of the equations are non¬ 
linear, iterative algorithms of various kinds are employed to obtain a numeric¬ 
al solution of the system equations. These algorithms generally start with an 
initial vector of all the variables, x , and they employ different strategies to 
modify x so as to converge to a solution by sequential stages. Typical algo¬ 
rithms are Newton’s iteration function (Tl) which employs first derivatives 
of the system equations to modify x ; the method of steepest descent (K2), 
which also employs first derivatives of the equations to modify x so that 

i ft\*) 

i= 1 

is minimized; the method of direct search (H3), which employs a search 
strategy for modifying x , and iteration functions with memory (Tl), which 
employs information from the most recent interations to modify x. Many 
other iterative methods have been described including Rosen (Rl), Nagiev 
(Nl), Vela (VI), and Komatsu (Kl). 

For a process that can be represented by a set of system equations of 
modest size, say from one to 100, the procedures mentioned above are reason¬ 
ably straightforward and efficient. However, as the number of equations in 
the system further increases, these methods run into difficulties because of 
the large digital computer storage requirements and the excessive amount of 
computer time needed to solve the entire system of equations simultaneously. 
Analysis of the system of equations before a solution is attempted can often 
substantially mitigate these two problems. Experience has demonstrated that 
a large system of equations that represents a physical process need not be 
solved simultaneously but can be broken up into subsystems in which only 
smaller sets of equations need be solved simultaneously. Further, if the sys¬ 
tem is ordered into a sequence such that each equation or various groups of 
equations can be solved independently of the remaining equations in the 
sequence, the solution can proceed serially, and the computer storage needed 
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to effect a solution would correspond roughly to the size of the largest subsys¬ 
tem of equations instead of the entire system. The time for solution would 
also be substantially reduced. Finally, if the equations within each parti¬ 
tioned subsystem can be ordered so that the number of iterates that need be 
chosen to obtain a solution of the subsystem is smaller than the number of 
equations in the subsystem, the computer time needed to solve the entire 
system may be reduced even further. Therefore, a systematic strategy to iden¬ 
tify such subsystems, to order the equations and subsystems in the proper 
sequence, and to order the equations within the subsystems would have con¬ 
siderable merit. This type of precedence ordering will be called a decompo¬ 
sition of the system. 

Decomposition leads to a rearrangement of the process equations from 
their flow chart sequence to a natural sequence based on the information flow 
among the equations. The ultimate goal is to set up an iterative scheme in 
which each equation is solved for a single variable (by some appropriate root 
identification method), and where values of unknown variables that must be 
assumed are checked cyclically. The greatest reduction in the number of 
iterates that must be assumed, and therefore the greatest reduction in com¬ 
puter storage and time requirements, takes place for those systems of 
process equations in which the number of variables per equation is small 
compared to the total number of variables in the system. Clearly, when 
each of the system equations contains every process variable, no effective 
decomposition can take place. Fortunately, most models used in the process 
industries are of such a character that extensive decomposition can be effected. 

Decomposition can be applied to models that represent any large scale 
integrated process whether formed for solving material balances, energy 
balances, and equilibrium relations in process design, or used for economic 
evaluation, or for analysis of the control of entire plants. One of the most 
fruitful applications is to digital executive programs such as pacer (S2), 
which can be used to design chemical plants in the sense that the parts of the 
plant can be assembled in the desired sequence and the internal subroutines 
used in design. Selection of a good precedence order for calculation among 
the process units can improve materially the operation of the simulation. 
Another application of decomposition, which has not been attempted yet but 
which appears promising, is in the optimization of large plants where the 
model includes many equality constraints that can be decomposed. Industrial 
scheduling and production planning as well as network analysis are areas of 
application outside of the process industries. 

We first describe in Section II how the information flow takes place in 
process models, give a compact method of representation of the system of 
equations, and point out the correspondence between a system of equations 
and a linear diagraph. In Section III, methods for finding an output set 
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(assignment of a dependent variable to each equation) is presented. Section IV 
presents procedures for partitioning the system into the smallest irreducible 
subsystems (groups of equations that must be solved simultaneously). 
Section V examines the concept of disjoint subsystems and the methods for 
finding them. Section VI examines the irreducible subsystems and discusses 
their internal ordering. Various methods of decomposition are discussed and 
compared in Section VII. Section VIII presents a large scale example of de¬ 
composition based on a digital computer program listed in Appendix A. 

Certain special problems related to decomposition are also considered. 
One is whether the process model is determinate, that is, does the model have 
a solution. Section III indicates how decomposition can help in validating 
determinacy. Another problem is that of convergence of the iterative strategy. 
If certain of the equations are too sensitive to the values of the variables being 
iterated, convergence may not be obtained. Therefore, the decomposition 
procedure must be constrained so as to choose as iterates only those variables 
for which the system of equations has suitable sensitivity, as discussed in 
Section VI. 


II. Information Flow in Process Models 

The information flow among the system equations can be represented by 
information flow diagrams in a fashion roughly analogous to the represen¬ 
tation of the process material flows by a process flow sheet. Typical examples 
of the graphical representation of information flow are organizational charts 
and signal flow diagrams. Associated with the information flow graphs are 
Boolean matrices, which have a one to one correspondence with the structure 
of the graph, but can easily be loaded into a digital computer for analysis. 
Decomposition of a large system can take place using either the graph or the 
associated Boolean matrix, but we are concerned primarily with the latter 
because the decomposition of a large graph by inspection would be imprac¬ 
tical. However, decomposition by inspection of an information flow diagram 
is quite feasible for a small number of system equations say less than 25. 

A. Graphs and Boolean Matrices 

We now examine some of the properties of graphs and their associated 
Boolean matrices (HI, H2), which aid in decomposition. A graph, G, is a 
collection of vertices, V (nodes, points, elements, subsystems) and lines joining 
the vertices called edges, E (flows, paths, streams). A digraph or directed 
graph is a graph in which the edges are directed (flow may occur in only one 
direction). A finite graph is one which has a finite number of vertices. Associa- 
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ted with each graph is a Boolean matrix R termed the adjacency matrix 
(associated matrix, relation matrix, structural matrix), which has as many 
rows and columns as the graph has vertices (the numbering of both the rows 
and columns correspond to the numbering of the vertices). An element 
r ij = 1 if there is a flow directed along an edge from vertex j to vertex /; 
r u = 0 otherwise. Thus, there is a one to one correspondence between a graph 
and its adjacency matrix. 

Figure 1 illustrates a digraph. The rows of the adjacency matrix corre¬ 
spond to the vertices to which flow is directed. The columns correspond to 
the vertices from which the flows are directed. 

Vertex 
12 3 4 

ooor 

1 0 0 0 R 

10 0 0 

0 0 1 0_ 

Fig. 1. Correspondence between a graph and its adjacency matrix R. 

The edge of the graph from vertex 1 to vertex 2 indicates a flow of information 
or perhaps material from vertex 1 to 2 and therefore the element r 2l = 1. 
Similarly, there is a flow directed from vertex 1 to vertex 3 so that r 31 = 1; 
also r 43 = 1 and r 14 = 1. All other elements of the matrix are 0 since there 
are no other flows in the graph. The adjacency matrix thus indicates all of 
the one step paths (flows between two vertices without an intervening vertex) 
between vertices. If r tj = 1, there is a one-step path from vertex / to vertex /. 
Therefore, all of the structure of the graph is contained in the adjacency 
matrix. 

In the language of symbolic logic the elements r {J of the adjacency matrix 
are the propositions: “There is a direct path from vertex j to vertex i" which 
admit of but two truth values: true designated by the symbol 1, and false 
designated by the symbol 0. Any matrices with elements of which meaningful 
products and sums are defined can be manipulated by the rules of matrix 
algebra (B2, PI) to give new matrices that are, in general, meaningful. This 
is true here if the product of two or more propositions is understood to mean 
their logical or Boolean product and the sum of a number of propositions is 
understood to mean their logical sum or Boolean union: 

Boolean multiplication —If x, y, z, ... are propositions, their logical pro¬ 
duct (x H y . z . • * •) in any order is the new proposition “All of x,y,z,... are 
true.” In the symbolism here adopted, the truth value of (x . y . z . • * •) is 1 
if that of every factor is 1 but it is 0 if that of any factor is 0. Briefly, 

x . y . z m • * • = min{x, y,z ,...} 
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Boolean union —If x, y, z, ... are propositions, their Boolean union or 
logical sum (x u y u z u • • •) is the new proposition “At least one of x, y, 
z, ... is true.” Thus, the truth value of (x u y u z u * * •) is 0 if that of every 
addend is 0, but it is 1 if that of any addend is 1. Briefly, 

z u y vj z u • • * = max{x, y, z,...} 

Boolean multiplication is distributive with respect to Boolean union; 
that is, 

x „ (y u z) = (x . y) u (x . z) 

wherein the parentheses on the right-hand side are usually omitted, the under¬ 
standing being that multiplications are to be performed before unions unless 
parentheses indicate otherwise. In fact, by choosing to indicate true and false 
by 1 and 0, respectively, Boolean multiplication is conveniently made to obey 
the rules of ordinary arithmetic. 

Boolean union, on the other hand, differs significantly from addition. 
We have, for example, 


x u x = x 

Further, Boolean union is distributive with respect to Boolean multiplication; 
that is, 

x u (y . z) = (x u y) , (x u z) 

but here the parentheses may not be removed on the right without ambiguity. 
There is no meaningful analog of subtraction; consequently the solution of 
propositional equations may not, as in ordinary algebra, involve the transfer 
of addends across the equals sign. 1 

Boolean matrix multiplication is carried out according to the usual 
matrix rules, expressed in terms of the logical product and logical sum as 
defined above. 

Thus, the elements c tJ of the matrix 

C = A.B 

are found by the formula 

n 

c ij— {Ja ik mb k j (2) 

k~l 

where the operational symbol 

n 


1 The sign — in propositional equations indicates equality of the truth values of the 
propositions equated, not necessarily, their identity or equivalence. 
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indicates the union of the terms typified by the product following it with k 
taken successively as 1, 2, 3 ... n, and n is the common dimension (or order) 
of the matrices A, B, and C. For example, in the product of the matrix R of 
Fig. 1 by itself to form R 2 , the first element in the first row, c n , would be 
calculated as follows: 

c n = max[min{0, 0}; min{0, 1}; min{0, 1}; min{l, 0}] = 0 
and the first element in the fourth row, as follows: 

c 41 = max[min{0, 0}; min{0, 1}; min{l, 1}; min{0, 0}] = 1 

It will be observed that, in general, A „ B ^ B . A. 

Another operation to be used is the union of rows or columns within a 
particular matrix. If A is an n by 1 matrix (a column of an n by n matrix), 
B is an n by 1 matrix (another column), then the n by 1 matrix C = A u B is 
defined by the elements 


Ci = fljU b t for i = 1 , 2, ... n 


( 3 ) 


For example 


'O' 


T 


T 

0 

U 

0 

= 

0 

1 


1 


1 


A similar definition is applied to the union of the rows. 

An important property of the adjacency matrix is that the ith power of 
this matrix gives all the / step paths between vertices. For example, the second 
and third powers of the adjacency matrix in Fig. 1 are shown in Fig. 2. The 



1 

2 

3 

4 


1 

2 

3 

4 

1 

"0 

0 

1 

0" 

1 

'1 

0 

0 

0" 

R 2 = 2 

0 

0 

0 

0 


0 

0 

1 

0 

3 

0 

0 

0 

0 

3 

0 

0 

1 

0 

4 

.1 

0 

0 

0 

4 

0 

0 

0 

1 


Fig. 2. Second and third powers of the adjacency matrix. 

rows of R 2 and R 3 still correspond to the vertices to which the flows are 
directed, and the columns correspond to the vertices from which the flows 
are directed. Each nonzero element of the matrix K k indicates there is a path 
going through k edges (a k step path) from vertex j to vertex /. Figure 2 indi¬ 
cates that a two-step path exists from vertex l to vertex 4 and a two-step path 
exists from vertex 3 to vertex 1. R 3 indicates not only the three-step loop 
tying together vertices 7, 3, and 4 but also the path 3 4 -> 1 -► 2. A kth 

order loop in a graph is defined as a set of k vertices each of which is connected 
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to every other vertex of the loop by a closed path. A kih order maximal loop, //, 
of a graph G , is defined as a set of k vertices in G connected by a closed path 
such that every other loop in G is either contained in H or has no vertex in 
common with H, In Fig. 3, the loop 1 -► 2 1 is not maximal, whereas the 



Fig, 3. Maximal and nonmaximal loops. 

loop maximal loop. A graph G then may contain several 

maximal loops of different orders. Because these loops bind together vertices 
in a common information network, the location of these maximal loops is an 
important phase in the decomposition of systems. 

One method of locating these maximal loops is to compute the reach¬ 
ability matrix, R*(H1), which is the element by element Boolean union of all 
of the powers of the adjacency matrix up to the wth, where n is the number of 
rows of R. An element of the reachability matrix is defined as 

r *j = 0 At 

k= 1 

where r,* is the element of the ith row and the y'th column of the fcth power 
of the adjacency matrix. The reachability matrix indicates all of the paths of 
any length between vertices. Figure 4 shows the reachability matrix for the 
graph of Fig. 1. 


1 

7 P 

R*= 2 1 

3 1 

4 1 


2 3 4 

o i r 
0 0 0 
0 1 1 
0 1 1 


Fig. 4. Reachability matrix of the graph in Fig. 1. 


The maximal loops in the graph can be found from the reachability 
matrix by finding those sets of vertices that satisfy the following conditions: 
(1) r* = r* = 1, where i and j take on all possible combinations of the vertex 
numbers in the set; (2) no other vertices, not included in the set, satisfy con¬ 
dition (1). The first condition requires that each vertex in the set is reachable 
by some path from every other vertex in the set. The second condition re¬ 
quires that there is no path from a vertex in the set to a vertex outside the set 
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and back again to a vertex in the set. If a set of vertices meets these two con¬ 
ditions, there must be a closed path that goes through every vertex in the set, 
which is by definition a loop. The loop found in this manner is maximal be¬ 
cause, if there were a larger loop that included it, the second condition would 
not be satisfied. Other methods for finding maximal loops are discussed in 
Section IV. 

B. Representation of System Equations 

Since the systems of equations to be considered are quite large, it is neces¬ 
sary to use some compact method to represent the information flow among 
them. A very convenient technique is to relate the system equations to a 
digraph and its associated Boolean matrix, which represent the structure of 
the information flow in the system. The Boolean matrix to be used is called 
the occurrence matrix (HI, S3), and is defind as follows: (1) each row of the 
occurrence matrix corresponds to a system equation, and each column cor¬ 
responds to a system variable; (2) an element of the matrix, s ij9 is either a 
Boolean 1 or 0 according to the rule 

fl, if variable j appears in equation i 
lj \0, otherwise 

This matrix then indicates the occurrence of the dependent variables in each 
of the system equations. For example, consider the following set of equations 
written in functional form: 

/i(*i. *2) = o 
/2O4) = 0 
h(* 3>*e) = 0 

/ 4 (x 4 , x 5 ) = 0 
/sO u x e) = 0 
f 6 (x 2 , * 3 , *5) = 0 

Figure 5 is the occurrence matrix corresponding to Eqs. (4). 

In decomposing a system of equations, it is necessary to analyze the in¬ 
formation flow among the equations concerning the values of the system 
variables (S3). In order to determine the direction of information flow in the 
system equations, one must first establish what information each equation is 
to supply, that is, the identity of the variable whose value is to be obtained 
from the equation. Further, the system equations together must supply all of 
the information about the system (the values of all of the variables). The 
variable for which an equation is to be solved is called its output variable and 
the set of all of the variables assigned to the equations as output variables is 
called an output set (S3, H2). Thus the information that one equation can 
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Variable 



*i 

X 2 

*3 

*4 

*5 

*6 

fl 

"i 

1 

0 

0 

0 

O' 

fl 

0 

0 

0 

1 

0 

0 

Equation / 3 

0 

0 

1 

0 

0 

1 

u 

0 

0 

0 

1 

1 

0 

fs 

1 

0 

0 

0 

0 

1 

fe 

.0 

1 

1 

0 

1 

0. 


Fig. 5. Occurrence matrix for Eqs. (4). 


communicate to the other equations is the value of its output variable. 
Simultaneously, information flows into an equation through all of the variables 
it contains except through its output variable. Cast in terms of the occurrence 
matrix an output set is a set of nonzero elements in the occurrence matrix 
such that one and only one element appears in each row and simultaneously 
one and only one element appears in each column. A unique output set may 
or may not exist for a system of equations, but for the purposes of initiating 
the decomposition, any output set will suffice. The encircled elements of 
Fig. 6 designate an output set for the occurrence matrix of Fig. 5. Methods 
for selecting an output set are discussed in Section III. 





Variable 




X 1 

*2 

*3 

*4 

*5 

*6 

fl 

r® 

1 

0 

0 

0 

O' 

fl 

0 

0 

0 

© 

0 

0 

Equation 

H 

0 

0 

0 

0 

o© 

0 

1 

0 

© 

1 

0 

fs 

1 

0 

0 

0 

0 

© 

fe 

.0 

® 

1 

0 

1 

0 _ 


Fig. 6. Occurrence matrix with output set. 

Once an output set has been established, the direction of information flow 
is fixed in the system of equations, and they can be represented either by a 
linear diagraph or its associated Boolean adjacency matrix. For our purposes 
it is more convenient to work with the Boolean adjacency matrix, which can 
be obtained directly from the occurrence matrix and output set as follows. 
First, assign numbers to the equations that correspond to the rows, and 
numbers to the variables that correspond to the columns of the occurrence 
matrix as in Fig. 5. Then pick an output set by the methods described in 
Section III. For the first equation, and the number of the column containing 
its output variable. The information flow transmitted by the variable desig¬ 
nated by the column number goes to all other equations that have nonzero 
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elements in the column. This flow^is identified by entering a nonzero element 
in the proper row of the first column of the adjacency matrix. Similarly for 
every other equation, the column in which the output element appears can 
be located, and the information flow from that equation to all other equations 
identified. Nonzero entries are made in the columns of the adjacency matrix 
for each equation (row) having such a flow. For example, in Fig. 6, column 1 
of the occurrence matrix contains the output element of Eq. (1), and the 
nonzero element in column 1, row 5, indicates a flow directed from Eq. (1) to 
Eq. (5). Therefore, a nonzero entry would be made in the element r 51 of the 
adjacency matrix R. The output element of Eq. (2) in Fig. 6 is in column 4, 
and a flow is indicated from Eq. (2) to Eq. (4). If this procedure is continued for 
all of the equations, the adjacency matrix of Fig. 7b is obtained. 

Variable 

*4 *3 *5 *6 *2 

0 0 0 0 1 ’ 

® o o o o 

0 ® 0 1 0 

10 0 0 0 
0 0 0 0 0 

0 1 1 0 ® 

(a) 

Equation 

fi fz f* f 5 fe* 

00001 " 

0 0 0 0 0 

0 0 0 1 0 

1 0 0 0 0 

0 0 0 0 0 

0 110 0 

(b) 

Fig. 7. Occurrence matrix (a) and associated adjacency matrix (b). 

To form the adjacency matrix in a simpler way, note that the elements of 
the first column of the adjacency matrix in Fig. 7b correspond exactly to 
the elements of the first column of the occurrence matrix in Fig. 6 if the out¬ 
put element is deleted. Similarly, the elements of column 2 of the adjacency 
matrix correspond exactly to the elements of the fourth column of the occur¬ 
rence matrix if the output element is deleted. Therefore, if the columns of the 
occurrence were permuted until all of the output elements appeared on the 
main diagonal, as in Fig. 7a, the nonzero elements of the occurrence matrix 


Equation 


’CD 

0 

0 

0 

1 

0 


Equation 


/i 

fi 

fz 

u 

/s' 

fe 


a 

'0 

0 

0 

0 

1 

0 
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would correspond exactly to those in the adjacency matrix except for the ele¬ 
ments appearing on the main diagonal. Thus, after the permutation of columns 
is completed, the adjacency matrix can be obtained by simply renumbering 
the columns so that they have the same sequence as the rows and then by de¬ 
leting the nonzero elements on the main diagonal (which in effect represent 
self-loops of information flow). 

We have shown thus far how the system of equations representing a pro¬ 
cess can be related to a linear diagraph and its associated Boolean adjacency 
matrix. In the Section IV, we show how the location of the maximal loops in 
this adjacency matrix leads to identification of the subsystems of equations 
that must be solved simultaneously. 


III. Finding an Output Set 

The concept of an output set was introduced in Section II. In this section 
we discuss how to find an output set, and how the procedure that is presented 
can help in ascertaining system determinacy. 

The problem of finding an output set can be viewed as a problem of assign¬ 
ing an output variable to each system equation in such a way that each 
equation has a single output variable and each variable is assigned to only one 
equation subject to the constraint that a variable can only be assigned to an 
equation if it appears in that equation. The problem of finding an output set 
is a special case of the assignment problem of linear programming (Cl) in 
which there is no objective function to be optimized and all feasible assign¬ 
ments are acceptable. 

The algorithm that has turned out to be the most effective in practice for 
finding an output set is that proposed by Steward (S3). Steward proved that 
application of the algorithm to a system of equations will lead to one of two 
outcomes. The first is that an output set will be found. The other is that a 
subset of equations is found that contains fewer variables than equations, in 
which case Steward has shown that no output set exists for the system of 
equations. 

Steward’s algorithm, presented below, is a logical sequence of steps to 
assign variables to equations, and is stated in terms of assigning columns to 
rows. Necessarily, some provision must be made to retain the identification of 
the variable numbers with the column numbers and the equation numbers 
with the row numbers. An assignment of a column to a row is indicated by 
placing the number of the column into the corresponding row of an n by 1 
matrix A. Initially, all of the elements of matrix A are set equal to zero. The 
algorithm involves two phases, assignment and reassignment. Assignment be¬ 
gins by assigning the first column (variable) of the occurrence matrix to the 
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first row (equation) in the column that has a nonzero element. Then the 
second column is assigned to the first row in that column that has a nonzero 
element. This procedure is continued until a column is assigned to a row 
previously identified with another column. When this occurs reassignment 
starts; the column that was previously assigned to the row is reassigned to 
another row that has a nonzero element in the column. If the row to which 
the column is reassigned also had a previous assignment, the previously 
assigned column is reassigned in the same manner. This procedure is con¬ 
tinued as long as a new column is not reassigned to any row more than once, 
in order to avoid getting into an infinite loop of reassignments. 

Each row and column in which a reassignment has been made is tagged to 
keep track of the assignments and reassignments. In the computer routine in 
Appendix A, a column is tagged by placing a 1 in the corresponding column 
of a 1 by n information matrix C and a row is tagged by placing a 1 in the 
corresponding row of an n by 1 information matrix T. 

If in the reassigment, a column is encountered that cannot be reassigned 
because it contains no nonzero elements in an untagged row of the occurrence 
matrix, it is necessary to examine all the columns previously reassigned 
(tagged columns) to see if any of them can be reassigned to an untagged row. 
If so, the reassignment continues. If not, the algorithm terminates and an 
output set cannot be found. The untagged rows correspond to a subset of 
equations that contain fewer variables than equations. 

The reassignment phase continues until a column is reassigned to a row 
that has not been previously identified with a column. Then the reassigment 
phase stops and the assignment phase takes up again. 

We now consider how Steward’s algorithm can help to ascertain whether 
or not the system of equations describing the process is determinate. It should 
be noted that if a system of equations having the same number of variables as 
equations incorporates a subset of equations that contains fewer variables 
than equations, a unique solution of the system equations is unlikely to exist. 
We have used the words “is unlikely to” rather than “does not” because 
there are some special classes of equations that specify more than one variable, 
and if such an equation is included in the system, the system may have a 
subset of equations with fewer variables than equations and still be deter¬ 
minate. For example, consider the system of Eq. (5): 

x 2 + y 4 = 0 

z^l (5) 

2z = 2 

The second and third equations together contain only one variable and are 
not independent. The system, however, has a unique solution since the first 
equation specifies that both x and y , if real, must be zero. However, these 
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special classes of equations usually do not appear in models of real physical 
process, and consequently, if a model is analyzed by Steward's algorithm and 
no output set can be found, the system of equations is most likely not deter¬ 
minate. 

In order to understand why a system of equations that includes a sub¬ 
system of equations with fewer variables than equations is not determinate, 
consider the following cases: Case A—the subsystem of equations contains 
an equation that is not independent for the remaining equations in the sub¬ 
systems; Case B—the variables for the subsystem are overspecified, that is, 
in the subsystem there are more equations than variables. At the same time, 
in the remaining equations, there are more variables than equations. Case B 
occurs because a variable was treated as a parameter (constant) in the equa¬ 
tions in which it appears, and a parameter was treated as a variable in the 
equations in which it appears. In Case A, the system equations are not con¬ 
sistent and no solution to the process model exists. In Case B, a solution 
cannot be obtained either. 

Steward’s algorithm, thus, can help locate errors in formulating a model 
of a process by identifying the set of equations that contains improperly 
specified design variables or parameters. The fortran program listed in 
Appendix A prints out the equation numbers in the subset that contains 
fewer variables than equations when an output set cannot be found. 


IV. Partitioning of the System Equations 

This section treats the partitioning of the system equations into the smallest 
irreducible subsystems, that is, the smallest groups of equations that must be 
solved simultaneously. Partitioning represents the first and easiest of the two 
phases of decomposition; tearing (which is discussed in Section VI) is more 
difficult. 

As an illustration of partitioning consider the following equations written 
in functional form 


/i(*i, *2)=0 
fi(x 2 , x 3 ) — 0 

/ 3 (*i, x 3 , x 4 ) = 0 (6) 

h Xxa. , * 5 ) = 0 

fs(x 4 , *s) = 0 


It can be seen by inspection that equations for functions / 4 and / 5 contain 
only variables x 4 and x 5 , and that it is possible to solve these two equations 
for the values of * 4 and x 5 independently of the remaining equations. Also 
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it would not be possible to solve either f A or f 5 alone since both contain two 
variables. Therefore f A and f 5 constitute a subsystem of equations which must 
be solved simultaneously. After f A and f 5 are solved and the value of x A is 
known, the remaining three equations contain only three variables whose 
values are unknown. Since each remaining equation contains two different un¬ 
specified variables, f u f 2 , and / 3 must also be solved simultaneously for the 
values of x u x 2 , and x 3 . Therefore, we can partition the system of Eqs. (6) 
into two subsystems (which must be solved simultaneously) instead of solving 
the whole system simultaneously. However, in order to solve the two sub¬ 
systems of equations independently of one another, the subsystem comprising 
f A and f 5 must be solved first to obtain a value for x A . If the subsystem com- 
prising f u f 2 , and / 3 were to be solved first, a value for x A would have to be 
assumed and the subsystem solved again when the correct value of x A was 
obtained from the subsystem of f A and f 5 . Therefore, an effective partitioning 
scheme must not only locate the subsystems of equations that must be solved 
simultaneously, but must also order the subsystems into a sequence such that 
each subsystem can be solved in its proper sequence. 

We first mention certain criteria for partitioning and tearing, and then 
describe methods of executing the partitioning. Tearing is treated in Section 
VI. 

A. Criteria for Decomposition Algorithms 

Certain desirable features are sought in and certain limitations are im¬ 
posed on any effective decomposition scheme. These are: 

(1) Its logical steps must be appropriate and consistent so that they can 
be programmed on a digital computer. 

(2) Its methods of analysis must be efficient enough so that large systems 
of equations (on the order of 1000 equations) can be analyzed in a reasonable 
amount of digital computer time. 

(3) It must partition the system into the smallest possible subsystems that 
can be solved independently in sequence. 

(4) It must indicate the precedence order in which the equations or sub¬ 
systems are to be solved. 

(5) It must choose as the output (dependent) variable of each equation a 
variable that can be solved for explicitly. 

(6) It must choose as the variables for iteration those for which the sen¬ 
sitivity of the system is appropriate and consistent with the particular solution 
technique chosen for the system. 

(7) It should order the equations within the irreducible subsystems so 
that the minimum number of variables need be iterated or specified as system 
inputs to obtain a complete solution of the system. 
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A sound decomposition strategy should be applicable to any type of 
mathematical model of a physical process. Therefore, the set of system equa¬ 
tions might include linear or nonlinear equations; algebraic, differential, 
difference, or integral equations; continuous or discrete variables with the 
following restrictions: 

(1) Each equation must be solvable for at least one of its dependent 
variables. 

(2) The set of n equations together must contain exactly n variables. 

(3) Each subset of i equations must be solvable for at least i variables 
where / = 1, 2, ... n, as discussed in Section III. 

(4) The system of equations must be determinate. Determinate here means 
that at least one solution of the set of equations exists within the physically 
allowable range of the variables. 

The best decomposition procedure is the one that fits the above criteria 
and facilitates the most economical solution of large systems of equations. 
Clearly, not all of the criteria are simultaneously completely compatible; 
hence any computer routine must necessarily represent certain compromises. 

B. How the Irreducible Sets of Equations (Those to be Solved Simul¬ 
taneously) Correspond to the Maximal Loops in the Adjacency Matrix 
and are Invariant of the Output Set 

We now demonstrate how the subsystems of equations that must be solved 
simultaneoulsy correspond to the maximal loops found in the related ad¬ 
jacency matrix. Consider the following set of two equations each containing 
common variables: 


/i(*i> *2) =0 
/zOi- *2) = 0 


Neither equation can be solved independently because both equations con¬ 
tain the same variables. In terms of information flow, regardless of the output 
set chosen, the first equation must feed information to the second equation 
and the second to the first, constituting a loop of information flow. For any 
number of equations, as long as each equation feeds information to the next 
equation in sequence, and the last equation feeds information to the first 
equation, the whole system of equations has to be solved simultaneously. 
If a set of equations comprises part of a larger set of equations, which them¬ 
selves form a larger loop of information flow, the subset must be solved to¬ 
gether with the bigger system. Thus, any set of equations in which each 
equation is included in a maximal loop of information flow must be solved 
simultaneously. 
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Previously we have termed the largest loop of information flow a maximal 
loop, and indicated that it is not tied into other loops, by definition. One 
might wonder, because the choice of an output set is not unique, whether the 
maximal loops of information flow in one adjacency matrix will differ from 
those of another adjacency matrix, i.e., one formed from a different output 
set. It is shown in the following paragraphs that the maximal loops will be 
the same and therefore any output set will suffice for accomplishing the par¬ 
titioning. 

Consider the relationship between output sets. Steward (S3) has shown 
that all of the output sets of a system of equations can be generated from one 
output set and the loops of information flow for that set. He showed that the 
only way another output set could be obtained would be to reassign the output 
variable of each equation in a loop to the equation in that loop which it feeds. 
For example, consider the following set of three equations: 


fi(x u x 3 ) == 0 

fi (*i> * 2 ) = 0 (8) 

Mx 2 ,x 3 ) = 0 

Suppose x x is assigned as the output variable of f u x 2 as the output variable 
of f 2 , and x 3 as the output variable of / 3 . With this output set, f x feeds 
information to f 2 ,/ 2 feeds information to / 3 , and/ 3 feeds information to f x 
constituting a loop of information flow. 

A new output set can be obtained by reassigning the output variables se¬ 
quentially in the loop as follows: x 3 , the output variable of/ 3 , is reassigned 
to the equation in the loop that/ 3 feeds, namely f x ; then, x x , the former out¬ 
put variable of f u is reassigned to / 2 , and finally, x 2 , the former output 
variable of f 2 , is assigned to/ 3 . In the new output set, x 3 is the output variable 
of f u x x is the output variable of / 2 , and x 2 is the output variable of / 3 . 
With this new output set,/i feeds information to f 3 ,/ 3 feeds information to 
f 2 , and/ 2 feeds information to f u forming a loop of information flow directly 
opposite to the loop based on the original output set. Steward (S3) also shows 
that the only way to proceed from one output set to another is to reassign 
the variables sequentially in a loop as described. Intuitively one can see that 
if a reassignment of an output variable is made in one equation, the former 
output variable of that equation must be reassigned to another equation, and 
so on continuing until the equation that contained the first reassigned variable 
as its output variable had assigned to it a new output variable. This reassign¬ 
ment can only result in a valid output set when the reassignment is carried out 
sequentially in a maximal or subordinate loop of information flow. 

For any maximal loop, which is by definition not connected to a larger 
loop, it is clear that the. information flow into and out of the set of equations 
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in the maximal loop is exactly the same regardless of the output set, because 
the same variables are assigned to each maximal loop for all output sets. 
Therefore, no new maximal loops can be formed upon changing the output 
set. The problem of partitioning, then, is reduced to finding all the maximal 
loops in an adjacency matrix and ordering these into the proper sequence. 

C. Methods of Partitioning 

One method of partitioning the system equations is to compute the 
maximal loops using powers of the adjacency matrix as discussed in Section 
II. Certain modifications to the methods of Section II are needed in order to 
reduce the computation time. The first modification is to obtain the product 
of the matrices using Boolean unions of rows instead of the multiplication 
technique previously demonstrated to obtain a power of an adjacency matrix. 
To show how the Boolean union of rows can replace the standard matrix 
multiplication, consider the definition of Boolean matrix multiplication, 
Eq. (2), which can be expanded to 

c u = a n . b i} u a i2 . b 2j ■ ■ • u a in . b nJ (9) 

According to the rules of Boolean operation 

0.1=0 Ou 1 = 1 

1.1 = 1 1 u 1 = 1 

1.0=0 1 u0 = 1 

Now, if a im = 0, the terms with k = m will not contribute to for j = 1,2, 
... n (i.e., all of the elements of row i in C). However, if a im = 1, every 1 in 
row m of B will appear in c {j forj = 1, 2,... n. Therefore, row i of C is just 
the Boolean union of the rows of B corresponding to the nonzero entries in 
row i of matrix A. 

This modification for Boolean matrix multiplication permits use of the 
Boolean union operation (logical or operation or logical sum) instead of 
regular multiplication and union operations. The Boolean union operation 
can be executed much faster on a digital computer. Experience has shown 
that performing the Boolean union of rows instead of the standard Boolean 
multiplication of matrices can reduce the computation time by as much as a 
factor of four. 

In order to obtain in a digital computer all of the powers of the adjacency 
matrix and then compute the reachability matrix by taking the Boolean sum 
of all of the powers, each power of the adjacency matrix and the reachability 
matrix would have to be stored in the computer memory. For large systems 
of equations an unreasonable amount of storage would be required. A second 
modification to the methods described in Section II is to drastically reduce 
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the number of matrices that must be stored by computing the powers of 
(A u I), where A is the adjacency matrix and I is the identity matrix. Note 
that the (A u I) matrix is just the adjacency matrix in which all of the elements 
on the main diagonal have been set equal to 1. Steward (S3) has shown that 
the nth power of (A u I) is equivalent to the reachability matrix. The square 
of (A u 1) is 

(A u I) 2 = [A „ (A u I)] u I , [(A ul)] = A 2 uAul (10) 

and the general relation for the nth power of (A u I) is 

(A u I)" = A” u A"” 1 u • • • u A u I (11) 

Thus, the nth power of the matrix (A u I) contains all of the nonzero elements 
of the reachability matrix plus a main diagonal of l’s. In the reachability 
matrix, only those rows that correspond to equations in a loop have nonzero 
elements on the main diagonal; however, since the criterion for equation i 
and equation j to be in the same maximal loop is that r* = r* = 1, where r* 
is an element of the reachability matrix, the extra elements on the main di¬ 
agonal of (A u I)* will only indicate that each equation is in a maximal loop 
with itself. Therefore, the maximal loops can be found in the (A u i)" matrix 
by the same procedure as discussed in Section II. Also, instead of computing 
each power of the (A u I) matrix, some computation time can be saved by 
multiplying the matrix by itself to obtain the second, fourth, eighth, etc. 
powers until the power is greater than n . Because of the properties of Boolean 
algebra, any power of the matrix greater than n will contain exactly the same 
nonzero elements as in the nth power, and therefore it is not necessary to 
compute exactly the nth power. 

With these modifications to the methods of Section II, only two matrices, 
the matrix of the current power and the one of the previous power, need be 
stored in the computer memory at one time, and the number of matrix multi¬ 
plications is reduced from n multiplications for the methods of Section II to 
log n/log 2 multiplications for the modified method. 

Actually, only one matrix need be stored if the adjacency matrix is stored 
initially and thereafter multiplied by itself. Matrix elements are replaced by 
the resulting product elements as they are computed. The product matrix 
obtained in this manner for the &th power may contain some nonzero ele¬ 
ments which correspond to paths longer than k steps instead of strictly k 
step paths, but this will not affect the final matrix obtained corresponding to 
the «th power, since these paths would eventually be identified in any case. 
All of the modifications to the methods of Section II mentioned above sim¬ 
plify the calculations needed to obtain the reachability matrix. The procedure 
for identifying the maximal loops given in Section II remains the same. 
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In addition to identifying the maximal loops, it is necessary to order them 
into a sequence such that the equations of each maximal loop feed information 
only to the equations of maximal loops appearing thereafter in sequence. One 
scheme of precedence ordering of the maximal loops is accomplished by first 
forming an adjacency matrix of loops, P, in which the rows and columns 
correspond to the maximal loops (S3). The elements of the matrix P are either 
1 or 0 according to the rule 

1 if the output variable of any equation in loop j 
Pij = appears in any equation of loop i for i ^ j. 

0 otherwise. 

The nonzero elements of a column, j\ of the matrix P indicate the loops that 
are fed information by the equations of loop j, and the nonzero elements of 
any row, j, indicate the loops that feed information to loop i. The ordering 
procedure for the irreducible sets of equatins (maximal loops) is as follows: 

(1) Choose the first column of the matrix P that contains all zeros. This 
column corresponds to a maximal loop, which is not fed information by any 
of the other maximal loops; hence the set of equations that make up the 
maximal loop can be solved independently of the remaining equations in the 
system. A maximal loop so identified is placed first in the ordering sequence 
for calculation. 

(2) The column of matrix P identified in step 1 and the row corresponding 
to the same maximal loop are removed from P to obtain a reduced matrix 
that contains the information flow among the remaining maximal loops. 

(3) Steps 1 and 2 are repeated on the reduced matrix and each maximal 
loop that is isolated from the others as it is detected is placed sequentially in 
the ordered sequence for solving the subsystems of simultaneous equations. 
The procedure is continued until all of the columns of the matrix P are 
removed. 

Several other methods of partitioning have been proposed, some of which 
take less computation time on a digital computer. 

Steward (S4) proposed one method of partitioning based on an earlier 
method of Sargent (SI), a method that involves tracing paths of information 
flow to find the maximal loops. The algorithm takes advantage of the fact 
that in many large systems of equations a significant number of the maximal 
loops contain only one equation and, in addition, in these large systems each 
equation contains only a small fraction of the total number of system variables. 
The first phase of Steward’s procedure is to remove all of the rows that con¬ 
tain no nonzero elements, and their corresponding columns, from the ad¬ 
jacency matrix to obtain a reduced matrix. These rows correspond to 
equations that are not fed information from any of the other equations in the 
system and consequently can be solved independently. Each equation is 
placed in the precedence scheme in the order in which it is detected. Next 



DECOMPOSITION OF LARGE SCALE SYSTEMS 


205 


there may be some equations that contain, besides their own output variable, 
only the output variables of equations already placed in the ordering sequence. 
These equations correspond to rows in the reduced matrix that contain no 
nonzero elements, and they can be placed successively in the ordering se¬ 
quence and the appropriate rows and columns removed from the reduced 
matrix to form an even further reduced matrix. Thereafter, rows with no non¬ 
zero elements in the reduced matrix are removed to obtain another reduced 
matrix, and so on until eventually a reduced matrix is obtained which cannot 
be further reduced. 

The second phase of Steward’s method is to trace a path of information 
flow starting with any equation and going to one of the equations that feeds 
it. Then the path is traced from this second equation to one of the equations, 
which in turn feeds it, and so on. Tracing of the path is continued until an 
equation is encountered twice. When an equation has been encountered twice, 
a path has been traced around a loop of information flow that includes all of 
the equations encountered in the path between the first and second times the 
repeated equation is encountered. 

In the adjacency matrix a path is traced by starting with a row, i, and 
tracing a path from that row to the first row, j, for which there is a nonzero 
element in the yth column of row L The path is then continued in the same 
way from row j to the first row that feeds it. When a loop is found, the set of 
equations in the loop must be solved together and, for the purposes of par¬ 
titioning, can be considered as one composite equation with the same infor¬ 
mation for the loop fed to it from the remaining equations in the system and 
the same information fed by it to the remaining equations in the system. A 
composite equation can be formed from the adjacency matrix by removing all 
of the rows and columns of the reduced adjacency matrix that correspond to 
equations in the loop, and the by adding one row, which is the Boolean union 
of all of the rows so removed, and one column, which is the Boolean union 
of all of the columns removed, to the reduced matrix. 

When a loop is found by the above procedure, it may or may not be a 
maximal loop and it may or may not contain the set of equations that should 
be solved next in sequence. If the loop is maximal and contains the set of 
equations to be solved next in the sequence, it will not be fed information by 
any of the other equations of the reduced system, and, in the reduced matrix, 
the row corresponding to the loop will contain all zero elements. Therefore, 
when a loop is found and the reduced matrix is formed, the procedure returns 
to the first phase and removes the rows without nonzero elements. When a 
row corresponding to a loop is removed from the matrix, the set of equations 
that correspond to all of the rows that were combined to form the composite 
row of the loop are placed next in the precedence order. If no rows have all 
zero elements, the procedure continues with the second phase by tracing a 
new path starting with any row of the reduced matrix and considering only 
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the loops and equations that remain in the reduced matrix. The procedure 
continues until all of the equations have been removed from the reduced 
matrix. 

To illustrate how Steward’s algorithm is executed, consider the adjacency 
matrix of Fig. 8a. The first row contains all zero elements and can be removed 
from the adjacency matrix; the equation represented by row l is placed first 
in the ordering sequence for solving the equtions. The removal of the first 
row corresponds to a cut shown by the dashed lines in the associated graph 
in Fig. 8b, and the portion of the graph below the dashed line is the graph of 
the reduced matrix obtained by removing row 1 and column 1 as in Fig. 9a. 

There are no rows with all zero elements in the reduced matrix of Fig. 9a, 
hence we proceed to the second phase of Steward’s algorithm by starting with 
the first row of the reduced matrix to trace the path 2 -> 3 -> 4 -> 2. The loop of 
information flow between vertices 2, 3 , and 4 is encircled by the dashed line 
in the graph in Fig. 9b. The rows and columns labeled 2, 3 , and 4 are next 
removed from the reduced matrix; and one row, which is the Boolean union 
of the rows labeled 2, 5, and 4 , and one column, which is the Boolean union 
of the columns labeled 2, 5, and 4, are added to the reduced matrix to obtain 
the new reduced matrix of Fig. 10a. The added row and column are labeled 
L t in Fig. 10a. 

There are still no rows with all zero elements in the reduced matrix of Fig. 
10a, so we trace the path L x ^5 -+6-> L x starting with the first row of the 
reduced matrix. The rows and columns labeled L u 5, and 6 are removed from 
the matrix, and one row and column, which are the Boolean unions of rows 
L u 5, and 6 and columns L u 5, and 6, respectively, are added to the matrix 
to obtain the reduced matrix of Fig. 11a. The added row and column are 
labeled L 2 in Fig. 1 la. The first row of the reduced matrix contains all zeros. 
Therefore the loop found is maximal, which indicates that the equations cor¬ 
responding to rows 2-6 in the adjacency matrix of Fig. 8a then must be solved 
simultaneously (and subsequently to the equation represented by the first 
row of that matrix). The row and column labeled L 2 in Fig. 1 la are removed 
leaving only row 7. The equation represented by row 7 must be solved last in 
the sequence. The final ordering sequence for solving the initial set equations 
is shown in Table I. 

Steward’s algorithm has proved to be quite efficient and easily program¬ 
med on a digital computer. The fortran program listed in Appendix A 
employs Steward’s algorithm to accomplish the partitioning. 

Billingsley (Bl) proposed an algorithm for partitioning that also involves 
tracing paths of information flow in the adjacency matrix. The algorithm 
begins, as does Steward’s algorithm, by removing all of the zero rows from the 
adjacency matrix until a reduced matrix is obtained containing no rows with 
all zero elements. Then a path is traced from equation to equation (row to 
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1 2 3 4 5 6 7 

1 [0 0 0 0 0 0 O' 

2 10 10 10 0 
20001000 

4 0 1 0 0 0 0 0 

5 0 0 0 0 0 1 0 

6 0 0 1 0 0 0 0 

7 0 0 0 0 0 1 0 

(a) 

Fig. 8 . Adjacency matrix and associated graph. 
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2 ro i o i oo“ 

5 0 0 1 0 0 0 

4 1 0 0 0 0 p 

5 0 0 0 0 1 0 

6 0 1 0 0 0 p 

7L0 0 0 0 1 0 

(a) (b) 

Fig. 9. Reduced matrix and associated graph. 


Lj 5 6 7 

L t [0 10 O' 

5 0 0 1 0 

6 10 0 0 

7 Lo 0 10 

(a) 

Fig. 10. Reduced matrix with loop included. 
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Fig. 11. Reduced matrix including maximal loop. 








208 


W. P. LEDET AND D. M. HIMMELBLAU 


TABLE I 


Final Ordering Sequence for Solving 
the Initial Set Equations 


Order 

Equation represented by rows 

1 

1 

2 

2, 3, 4 , 5, 6 y simultaneously 

3 

7 


row in the adjacency matrix) as in Steward’s algorithm, except that in the 
Billingsley algorithm the path does not terminate when an equation is en¬ 
countered a second time. When an equation is encountered a second time, it 
is not added to the information flow path. Instead the path tracing continues 
from the last equation encountered in the path that has a feed from an equa¬ 
tion not yet included in the path. A record is kept of all of the equations that 
are encountered in the path but that do not yet feed the next subsequent 
equation encountered. These equations are the last equations found in tracing 
the path around a loop. The path tracing terminates when no equation in the 
path is fed by an equation not in the path. 

Next, for each terminal equation recorded, a new path is traced starting 
with the terminal equation and going from equation to equation as before. 
Each of these paths corresponds to a set of equations in which all equations 
are fed by other equations in the path. Billingsley shows that the shortest of 
these paths corresponds to a set of equations that must be solved simultan¬ 
eously (a maximal loop), and, further, this set of equations must be solved 
next in the ordering sequence since it is not fed by any other equation cor¬ 
responding to rows of the reduced matrix. The rows and columns correspond¬ 
ing to equations in the shortest path in the reduced adjacency matrix are 
removed from the matrix and the corresponding set of equations added to the 
ordering sequence. No rows or columns are added back to the reduced matrix 
when a loop is found, as in Steward’s alogorithm, because every loop found is 
maximal. The algorithm does proceed to remove rows with all zero elements 
and the corresponding columns. It terminates when all of the rows are 
removed from the reduced adjacency matrix. 

Billingsley’s alogrithm is somewhat less efficient than Steward’s algorithm 
because in the former much longer paths must be traced in the adjacency 
matrix and each path must be traced twice. However, in small scale systems 
where the number of loops included within maximal loops is small, Billings¬ 
ley’s method is somewhat faster because of the time required to form the 
Boolean unions of rows and columns in Steward’s method. Both Steward’s 
and Billingsley’s methods are significantly faster for computation on a digital 
computer than the methods of Section II. 
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V. Identifying Disjoint Subsystems 

In formulating a model of a very large process such as a whole chemical 
plant, the possibility exists that a subset of the system equations does not 
contain any variables in common with the remaining equations in the system. 
Such a subset of equations may physically correspond to a process unit or 
group of process units that are not connected in any way to the remaining 
units in the process. If this situation occurs, the subset of equations, which is 
called a disjoint subsystem, can be solved completely independently of the 
remaining equations in the system. Identification of these disjoint subsystems 
reduces the dimensionality of the complete system to that of the largest dis¬ 
joint subsystem. 

In order to clarify what a disjoint subsystem is, consider Eqs. (12): 


7i(*i> * 3 ) = o 

fl(x 2 . X 4 ) = 0 
f 3 (x 2 ,X A ) = 0 
/*(* i) = 0 


The functions and / 4 contain only variables x t and , variables that do 
not appear in the remaining two equations, and therefore,/! and/ 4 constitute 
a disjoint subsystem (/j itself is also disjoint). The occurrence matrix of Eq. (12) 
is shown in Fig. 12. If a column permutation i 1 (23) is performed on the 
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Fig. 12. Occurrence matrices of Eq. (12). 


(c) 


i 


occurrence matrix of Fig. 12a followed by a row permutation n R (124) 9 the 
occurrence matrix of Fig. 12b is obtained, which is in block diagonal form. 
The off-diagonal blocks contain all zero elements and each block on the 
diagonal represents a disjoint subsystem. If an output set was assigned to the 
system of equations and the corresponding graph drawn, a separate graph 
would be obtained for each disjoint subsystem with no edges connecting the 
two graphs (refer to Fig. 12c). 

Of course for large systems, it is impractical to ascertain by inspection 
which row and column permutations will yield a block diagonal form or to 
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draw a graph to assist in the analysis of the system. A systematic procedure 
for finding disjoint subsystems was proposed by Himmelblau (HI) based on 
an algorithm of Netter (N2) as follows: 

(1) Select the column k of the n by n occurrence matrix that contains the 
most nonzero entries. For example, in the occurrence matrix of Fig. 12a, 
choose column 1. Columns 2 or 4 could have been chosen just as well. 

(2) Form a new j by n Boolean matrix, M (0) as follows: For each zero 
entry in column k , reproduce the corresponding row as a row in M (0) . For 
example, the second row of the occurrence matrix in Fig. 12a contains a zero 
in column k = 1 and therefore the element = 0, element m ( $ = 1, 
m? 3 = 0, and m° lAr = 1 comprise the first row of M (0) . The second row of M (0) 
would be exactly like the third row of that occurrence matrix. This process is 
continued until all of the rows with zero entries in column k have been in¬ 
cluded as rows of M (0) . A final row is added to M (0) , whch is the Boolean 
union of all of the rows in the occurrence matrix which contain nonzero 
entries in column k. For example, rows 1 amd 4 of the occurrence matrix 
contain nonzero elements in column 1 so that the elements of the last row of 
M (0) are m 31 = 1, m 32 — 0, m 33 = 1, m 34 = 0. Figure 13a illustrates M (0) . 
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Fig. 13. M (0) and M (1) matrices of occurrence matrix of Fig. 12. 

(3) A new matrix M (1) is formed from M (0) in the same manner that M (0) 
was formed from the occurrence matrix. The column of M (0) containing the 
most nonzero elements is identified, and M (1) is made up of rows identical 
to each row of M (0) , which contains a zero in column k and one final row, 
which is the Boolean union of the remaining rows in M (0) . Figure 13b illus¬ 
trates M (1) . A record is kept of which rows of the original occurrence matrix 
have been combined to form each row of M (0) , M (1) , and so on. 

(4) New matrices M (2) , M (3) , etc. are formed until the matrix M ( " } is ob¬ 
tained, which contains exactly one nonzero element in each column. Each row 
of the matrix M (o) corresponds to a subset of equations in the original system. 
Since each column of M (n) contains only one nonzero element, each variable 
of the system appears in only one subset of equations represented by the row 
in which the nonzero element appears. Therefore, the subset of equations 
represented by a row of M (rt) is a disjoint subsystem. For example, the matrix 
M (1) illustrated in Fig. 13b has only one nonzero element in each column. 
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Since the first row of M (1) is made of rows 1 and 4 of the original occurrence 
matrix,/! and/ 4 constitute a disjoint subsystem;/ 2 and/ 3 comprise the other 
disjoint subsystem. 

The algorithm is easily programmed on a digital computer and is quite 
efficient in splitting large systems into their separate parts. 


VI. Tearing 

Once the complete system of equations has been partitioned into the 
irreducible subsystems of simultaneous equations, it is desirable to decompose 
further these irreducible blocks of equations so that their solution can be 
simplified. The decomposition of the irreducible subsystems is called tearing. 
In the remainder of this section the subsystems of irreducible equations found 
by partitioning will be referred to as blocks to distinguish them from the 
smaller subsystems of simultaneous equations obtained within a block after 
the tearing is accomplished. 

A. The Concept of Tearing 

All of the methods presented in this section are for tearing one block at a 
time. Specifically, the concept of tearing a variable consists of removing it 
from considerations as a variable in one or more of the equations in which it 
appears. More than one variable can be torn simultaneously. Tearing of a 
variable is effected by assuming its value as determined, say, by physical 
principles or an educated guess, followed by reordering of the equations in a 
block. Because the value of the variable is assumed, it must be checked at 
some stage in the sequence of calculation for the block, and because the sub¬ 
sequent value rarely agrees with the assumed value, the torn variable becomes 
an iterated variable. 

In tearing, the objective is to wind up with less computation time required 
to solve the torn system compared with the time required to solve the entire 
block of equations simultaneously. However, the criteria for evaluating the 
effectiveness of the tearing are by no means so well defined as those for par¬ 
titioning, where the objective is clearly to obtain the smallest possible sub¬ 
systems of irreducible equations. There is no general method for determining 
the time needed to effect a solution of a set of equations; it is necessary to 
consider the particular equations involved. Any feasible method of tearing, 
then, must be based on criteria that are related to the solution time. Some of 
the more obvious criteria are: 

(1) to tear a block so that the number of equations in each subsystem of 
resulting simultaneous equations in the torn block is mininimized; or 
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(2) to tear the block of equations so that the minimum number of sub¬ 
systems of simultaneous equations remains in the torn block; or 

(3) to tear a subsystem so that the product of the number of subsystems 
and the number of equations per subsystem in the torn block is minimized. 

In order to illustrate tearing, consider the block of Eqs. (13): 


AOi, *3) = o 

/ 2 (xi,x 2 ,x 3 ) = 0 (13) 

/ 3 (x 2 , x 3 ) = 0 


A tear can be made by removing x 3 from f x . With the value of x 3 assumed, 
f x can be solved for x v Then f 2 and/ 3 could be solved together for x 2 and x 3 . 
Finally, the calculated value of x 3 could be compared to the assumed value, 
an iterative sequence of calculations of the values of x x , x 2 , and x 3 executed 
until the roots of f u f 2 , and f 2 are obtained to the desired accuracy. Another 
equally valid tear would be to remove x 3 as a variable in f x and f 2 by assuming 
a value for x 3 . Then x t could be calculated from x 2 could be calculated 
from f 2 , and x 3 calculated from / 3 . Again the values of the variables can be 
iterated until a solution is obtained. Whether a variable is torn from one or 
more than one equation in the block depends upon the particular criterion 
used forjudging the effectiveness of the tear. In general, if a variable is torn 
from only one equation and the value of the output variable calculated in 
that equation is relatively insensitive to the value assumed for the torn vari¬ 
able, only a small number of iterations will be needed to obtain a solution. 
However, when a variable is torn from only one equation, the total number of 
tears and therefore the total number of variables that are to be iterated, will 
be more than if the variable were torn from several of the equations in which 
it appears. 

We now discuss two methods of tearing the irreducible blocks. 

B. Tearing Methods for Large Scale Systems of Equations 

Steward (S3) proposed an algorithm based on tearing a variable from only 
one equation at a time and evaluating each tear on the basis of the size of the 
resulting subsystems of simultaneous equations in the torn system and 
numerical considerations of the particular equations. Each variable is torn 
successively from each equation in which it appears and the effectiveness of 
the tear evaluated. 

The algorithm is executed on the adjacency matrix of a block. In order to 
determine how many subsystems of simultaneous equations will remain after 
a tear, one must first enumerate all of the loops of information flow in the block 
and record which equations are included in each loop. The loops are found 
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by tracing a path of information flow starting with any equation and going 
from that equation to another equation that feeds it information, i.e., opposite 
to the information flow, and then to an equation that feeds the second equa¬ 
tion, and so on exactly as in Steward’s algorithm for partitioning (see Section 
IV). When the path encounters an equation twice, a loop is found that includes 
all of the equations encountered in the path between the first and second 
encounters of the repeated equation. 

For example, in the adjacency matrix of Fig. 14 we can start with the first 
row and trace a path fromto f 5 because there is a nonzero element in row 1 
and column 5. Then the path is traced from f 5 back to f x because of the non¬ 
zero element in row 5, column 7, yielding the loop /1-/5-/1. After this loop 
has been found, each path traced from the vertices in this loop will yeild a 
loop. We can return to the last equation found in the loop, f 5 , and trace a 
path from f 5 to another equation that feeds it,/ 3 . The path is then continued 
from / 3 to the first equation that feeds it, f 2 , and from f 2 back to / 3 . Thus 
the loop / 3 -/ 2 -/ 3 has been found. We return to the last equation found, f 2 , 
and see that no other equation feeds it, in which case we must return to the 
equation found just previous to f 2 , namely/ 3 . Now/ 3 is fed by / 4 and a new 
path can be traced to obtain a loop fy-fi-fs-fs • Steward continues this pro¬ 
cedure until all of the feeds to each equation have been exhausted, but it is 
not obvious when this situation occurs except if a “tree” is drawn of the 
paths. For a large block, drawing a tree may not prove to be especially 
feasible. 

Figure 15 is a tree of the paths traced from the adjacency matrix of Fig. 14. 
The edges drawn from f x to / 5 to /j comprise the path traced to obtain the 
f\~f l loop, and the edges from f 5 to / 3 to f 2 to / 3 correspond to the path 
traced to find the / 3 -/ 2 -/ 3 loop above. Each equation in the tree with a line 
drawn under it is the terminal of a path indicating a loop, and the capital 
letter identifies the loop. In terms of the tree, Steward’s procedure for obtain¬ 
ing the loops is to trace a path starting with any equation until a loop is found, 
and then start from the last equation in the path upstream that is not repeated 
to trace another path. If an equation does not have another feed, proceed to 
the next equation upstream in the path and trace a new path from it. Each 
edge that leaves an equation and the equations and edges following it in the 
tree comprise a branch. All of the loops have been found when no new path 
can be traced from any equation in the tree. This method of finding loops 
does not lend itself to programming on a digital computer very well, but 
Steward (S3) has proposed another method that is more systematic but too 
complex to be discussed here. All the loops for small blocks can be found 
quite readily by forming a tree. 

If a variable is torn from an equation, the torn block will retain all of the 
same information flow as the original block except that there will no longer 
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Fig. 15. Tree of paths traced in adjacency matrix of Fig. 14. 


be any information flow from the equation for which the torn variable is the 
output variable to the equation from which the variable is torn. Therefore, 
the torn block has all of the same loops as the original block except for those 
that included the path of information flow through the torn element. 

In the adjacency matrix a tear is accomplished by removing the nonzero 
element in the column corresponding to the equation in which the variable is 
the output and the row corresponding to the equation from which the variable 
is torn. For example, in Fig. 14 removal of the nonzero element of row 1 and 
column 5 corresponds to tearing the output variable of f 5 from f { . When this 
tear is made there is no longer any information flow directly from f 5 to f x in 
the torn block, and loop A (/ 1? / 5 ) is broken; only loops B , C, and D are 
retained in the torn block. If the element of row 3 and column 4 is torn instead 
of the element of the first row and fifth column, loop D is broken, and only 
loops A , B , and C remain in this torn system. 

All of the elements of the adjacency matrix can be torn in turn and the 
effectiveness of the tearing of each element evaluated by Steward’s criteria. 
In addition, other tears can be made that correspond to tearing a variable 
from the equation in which it is the output variable. In order to evaluate these 
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remaining tears, a new output set and adjacency matrix must be found that 
includes none of the elements of the original output set. The former output 
elements are located in the adjacency matrix and torn. 

Steward’s method for choosing the tears breaks the most loops and there¬ 
fore minimizes the number of loops remaining in the torn system. It is easily 
seen that if two loops contain two or more equations in common and in the 
same sequence, the path of information flow between these equations is part 
of both loops, and if the path is broken by a tear, both loops will be broken, 
For example, the path from f 5 to f x is contained both in loops A and D of 
Fig. 15. If the element in the fifth row and first column of the adjacency 
matrix is removed both loops A and D are broken. 

In order to find those tears that will break more than one loop, one must 
find the paths of information flow that are included as part of more than one 
loop. This can be done by first forming a new occurrence matrix in which the 
rows correspond to the loops and the columns to the equations. A nonzero 
entry is made in a row, /, and column, j if equation j is included in loop i. 
Figure 16a shows the occurrence matrix of the equations involved in the loops 
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Fig. 16. Occurrence matrix of equations in loops. 


of the adjacency matrix in Fig. 14. The new matrix contains the informa¬ 
tion concerning the occurrence of the respective equations in the loops. 

If an equation appears in more than one loop, one tear can break all of 
the loops in which the equation appears if in each of these loops the equation 
included in both loops has exactly the same equation feeding it in the loop. 
For example, column 2 of the occurrence matrix in Fig. 16a contains nonzero 
elements in rows 2 and 4 , Upon inspection of the corresponding loops, B 
and Z), in the tree of Fig. 15 it is seen that f 2 is fed by / 3 in both loops (/ 3 
appears immediately after f 2 in paths involved in loops B and D). Therefore, 
loops B and D can be broken by tearing the output variable of f 3 from the 
equation f 2 . Now instead of putting l’s as entries in row i and column j of 
the occurrence matrix to indicate that equation j appears in loop i, we insert 
instead the number of the equation that feeds equation j in loop i as in Fig. 
16b. Then if the same nonzero number appears more than once in the same 
column, all of the loops represented by the rows in which that number appears 
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can be broken by tearing the output variable of the equation corresponding 
to the nonzero entry in the column from the equation represented by the 
column. For example, the two 3’s in column 2 of the occurrence matrix of 
Fig. 16b indicate that loops B and D can be broken by tearing the output 
variable of Eq. (3) from Eq. (2). 

Steward’s algorithm is simple and very well suited to hand calculations 
for smaller systems. However, the computer storage and time requirements 
needed to tear large systems are prohibitive. 

Ledet (LI) proposed a somewhat different algorithm for tearing based on 
tearing a variable from all the equations except one in a block. He evaluated 
the effectiveness of the tears in terms of finding the minimum number of tears 
to reduce the torn block to subsystems of single equations. In this method no 
initial output set in a block is chosen, and the tearing and assignment of an 
output set are carried out simultaneously. For example, consider the block of 
Eqs. (14), which must be solved simultaneously. We could tear x 3 from 
equations f x and 

/i(*i> *3)=0 

f 2 (x 1 ,x 2 ,x i ) = 0 (14) 

f 3 (xu x 2 ,x 3 ) = 0 

f 2 and assign x x as the output variable of f u x 2 as the output variable of f 2 , 
and x 3 as the output variable of f 3 . The torn system could be solved by (1) 
assuming a value of x 3 ; (2) computing the value of x x in f x ; (3) substituting 
the value of x x into f 2 and computing x 2 ; and (4) finally computing a value 
for x 3 in/ 3 . Then the assumed and calculated values of x 3 could be compared, 
and the value of x 3 adjusted by an iterative procedure until the assumed and 
calculated values coincided. 

Another equally valid tear would be to tear x 2 from f 2 and tear x 3 from 
the equations f 2 and f 3 , and assign x 3 as the output variable of f u x x as the 
output variable of f 2 , and x 2 as the output variable of f 2 . Then the system 
could be solved by (1) assuming values for x 2 and x 3 ; (2) computing the value 
of x x in equation f 2 \ (3) computing the value of x 3 in f x ; and (4) computing 
the value of x 2 in / 3 . The values of x 2 and x 3 could then be established by 
iteration. In both of the suggested tears, the order in which the equations are 
to be solved for their output variables is determined by the information flow 
between equations in the torn system. In both cases the torn system results 
in equations that do not have to be solved simultaneously. But only one vari¬ 
able is torn in the first case as compared with two in the second, hence the 
first tear is a better tear according to the criteria of evaluation in Ledet’s 
algorithm. 

In terms of the occurrence matrix, the best tear is obtained when the rows 
and columns have been reordered so that each element on the main diagonal 
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is nonzero (the output set of elements lies on the main diagonal) and the 
number of columns that contain nonzero elements above the main diagonal 
is minimized. The columns with nonzero elements above the main diagonal 
represent the torn variables. For example, Fig. 17 illustrates the occurrence 
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Fig. 17. Occurrence matrix with best tear. Torn variables are encircled. 


matrix of the set of Eqs. (14) reordered to correspond to the tearing of x 3 
from f x and f 2 . The encircled elements are the torn elements. 

Ledet’s algorithm involves two phases. In the first phase an ordering of 
the rows and columns of the occurrence matrix takes place according to cer¬ 
tain optimality criteria. The second phase involves reordering the occurrence 
matrix to reduce the number of torn variables. The first phase carries out an 
initial tearing, and the second phase .improves the results of the first phase. 
However, instead of tearing individual variables as in Steward’s algorithm, 
Ledet’s method systematically reorders the occurrence matrix as described 
below. 

The optimality criterion given above does not specifically help to reach a 
decision about which variable to tear. A reordered occurrence matrix is formed 
one row at a time from the rows of the original occurrence matrix according 
to the three optimality criteria given below. As eacn row is placed in the re¬ 
ordered occurrence matrix, it is assigned an output variable (column), the 
assignment being subject to the restriction that a given column cannot be 
assigned to more than one row. Because a row may have nonzero elements 
not assigned as the output variable for that row or any other row, the vari¬ 
ables (columns) corresponding to these nonzero elements must be considered 
as potential iterates. They will become actual iterates if the corresponding 
column is assigned as an output column for some subsequent row. 

The optimal way to choose the rows to form the reordered occurrence 
matrix is as follows: 

(1) Select first the row that has the minimum number of potential iterates. 

(2) If two rows are equivalent under criterion 1, select the row that has a 
nonzero element in the column for which the total (vertically) of nonzero 
elements is the greatest. 

(3) When a row, j, is chosen that introduces one or more potential 
iterates, the next rows are chosen according to the following conditions: 
(a) Each row contains one and only one nonzero element in a column not yet 
chosen as an output column (variable), or column which is not yet a potential 
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iterate, (b) Each row contains one and only one nonzero element in the output 
column or potential iterate column of all the previous rows up to and in¬ 
cluding row L 

The reason for selection of the rows by criterion 3 is that ail of the rows in the 
set can be reordered simultaneously to permit any of the output and potential 
iterate columns to become actual interates in the solution of the related 
equations. 

An effective set of iterates can be chosen as follows from the output vari¬ 
ables (columns) and potential iterate columns identified by criterion 3 above. 
Each time a set of rows is selected by criterion 3, a consolidated column is 
formed of the Boolean sum of all the columns corresponding to the output 
elements of the set plus one potential iterate column. The consolidated column 
is substituted for the column corresponding to the potential iterate in the 
original occurrence matrix. When a row j of the original matrix is placed in 
the reordered occurrence matrix, a row for which the consolidated column be¬ 
comes the output variable, the proper output variable (column) for that row 
must be determined. It can be identified by searching the columns comprising 
the consolidated column and selecting any one that has a nonzero element 
in row j. This column corresponds to a variable that will become an iterate 
in solving the system equations. 

In choosing rows according to the optimality criteria, certain feasibility 
conditions must also be met: 

(1) Certain variables (columns) may be prohibited from being selected as 
iterates because, for example, the sensitivity of that equation (row) for the 
particular variable is too great. Those variables that the user wants to omit 
as iterates can be specified in the computer program in Appendix A. 

(2) Certain variables (columns) may be prohibited from being selected as 
the output variable for a particular equation (row) because, for example, it is 
not feasible to solve the equation for that variable. Variables in this category 
can also be specified in the computer program in Appendix A. 

(3) As each row is assigned an output column there must exist at least one 
possible assignment of the columns that have not yet been assigned as out¬ 
puts to the rows not yet chosen. This condition can be determined by applying 
the methods of Section III to the rows and columns not yet placed in the re¬ 
ordered occurrence matrix. 

The second phase of Ledet’s algorithm is carried out on the reordered 
occurrence matrix resulting from the first phase of the algorithm described 
immediately above. Because the second phase is quite complex, it will not be 
discussed here, but the details can be found in the original reference. Ledet 
shows that in most cases the first phase of the algorithm will obtain the mini¬ 
mum number of tears, or very nearly the minimum number of tears, and that 
the extra effort of executing the second phase is seldom justified. 
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C. Tearing of a System of Process Units 

To this point we have viewed the mathematical representation of a process 
as just a system of equations and ignored the physical grouping of equations 
by process units or pieces of equipment. In many instances, the designer or 
analyst wants to treat each block on his process flow sheet as a single sub¬ 
system to and from which information flows by many routes. For example, 
he may have to use a computer subroutine or a nomograph to relate the unit 
output(s) to the unit input(s) for a single unit operation such as evaporation 
or heat exchange. Under these circumstances, each unit corresponds to a node 
in a graph and there would be several interconnections between pairs of 
nodes that correspond to material flows. In an adjacency matrix the rows and 
columns that correspond to individual units and not to individual equations 
as in the previous discussion. 

The problem to be considered now is how to tear effectively a system of such 
units, units interconnected by material (and probably energy) flows. We as¬ 
sume that the input-output relationships are known for each unit, and that 
outputs must be calculated from the inputs. Each physical flow corresponds 
to several variables, and the criterion for tearing will be to minimize the 
number of variables that must be assumed to solve the torn system, i.e., to 
have the minimum number of variables associated with the total of the torn 
streams. 

The complete process can be represented by a graph in which the vertices 
represent the process units and the edges represent the material streams. The 
edges are directed from one unit to another according to the direction in 
which material flows. Associated with each edge is a number that corresponds 
to the number of variables (concentrations, temperature, pressure, flow rates, 
etc.) whose values must be known to completely specify the stream. For each 
type of process unit (heat exchanger, junction, pump, reactor, etc.), there is 
a given relationship between the variables of the output streams from a unit 
and the variables of the input streams to the unit if the design parameters for 
that unit are given. For example, suppose the process consisted of a reactor 
followed by a distillation column, and the bottoms stream of the distillation 
column was recycled to the feed of the reactor. The process could be repre¬ 
sented by the graph of Fig. 18, in which vertex 7 is a source of feed for the 



Fig. 18. Graph of reactor process. 
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reactor, vertex 2 is the junction of the recycle stream and the feed stream, 
vertex 3 is the reactor, vertex 4 is the distillation column, and vertex 5 is a 
product storage tank. The lower case letters—a, b, c, d, e—represent the 
numbers of variables associated with each stream. 

Now if all the variables of the feed stream and the design parameters of 
each unit are given, the variables of each stream could be calculated by 
assuming the values of the variables in the recycle stream, calculating the 
variables of the output stream of the junction, calculating the variables of 
the output stream of the reactor, and finally calculating the variables of the 
output streams of the distillation column. The calculated values of the vari¬ 
ables of the recycle stream could then be compared to the assumed values, 
and by iteration the proper values of all of the variables determined. In this 
example, we have torn the lone recycle stream to arrange a precedence order 
for the solution, and other equally effective tears could be determined by 
inspection. In more complicated processes the particular streams to tear are 
not so obvious and some means of optimal tearing is needed. 

A process can also be represented by its adjacency matrix, which can be 
formed without first drawing the graph, by just assigning each unit a number 
and placing a nonzero entry in column j and row i if there is a stream directed 
from unit j to unit L Once the adjacency matrix is formed, it can be partitioned 
into blocks of units that must be solved simultaneously exactly as described 
in Section IV. It is the tearing of these blocks that is of interest here. In both 
of the methods of tearing discussed here the objective is to tear a block so 
that the minimum number of variables will have to be assumed in solving the 
process equations involved in the block. 

Sargent (SI) proposed a method of tearing in which all of the possible 
orderings of units in a block are considered, and dynamic programming is 
used to find the optimum order. The objective of the algirithm is to order the 
units into a sequence such that the minimum number of variables are associated 
with the output streams of units that feed preceeding units in the sequence. 
The objective is equivalent to ordering the units so that the maximum number 
of variables are associated with the feed forward streams. 

In general, the algorithm is a sytematic procedure for building sequences 
of units by adding one unit at a time and evaluating each sequence after it is 
built in terms of the number of variables associated with all of the feed for¬ 
ward streams. Each sequence corresponds to an ordering of the units in a 
sequence opposite to that in which they are solved. Once all of the possible 
sequences of every unit in the block have been obtained, the best ordering is 
chosen according to which sequence has the maximum number of variables 
associated with feed forward steams. 

Sargent also proved the following theorem: “If a sequence of k units is 
in optimum order, then the subsequence of (k - 1) units obtained by removing 
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the first unit must also be in optimum order.” Sargent’s algorithm used this 
principle to avoid storing all of the permutations of orderings of a set of 
units by retaining instead the optimum permutation. 

The algorithm begins by choosing one unit, i, of the block and forming a 
subgroup comprised of unit i and one unit that feeds it material. The number 
of variables associated with the input stream to the first unit is recorded in 
terms of the number of feed forward variables associated with the subgroup. 
Then other subgroups are formed, in the same manner, with unit i and each 
unit that feeds it. Next every other unit in the block is chosen in turn and sub¬ 
groups formed in a similar manner. If a subgroup is formed that is just a per¬ 
mutation of the previously formed subgroup, the subgroup that contains the 
maximum number of variables associated with feed forward streams is re¬ 
tained and the other subgroup discarded. In this manner all of the subgroups 
of two units are formed. Next the subgroups of three units are formed by 
adding in turn each unit that feeds one of the pair in the group of two units. 
Again, if any subgroup is obtained which is a permutation of a previous 
subgroup, only the one that has the maximum number of variables associated 
with feed forward streams is retained. Subgroups of four, five, six, etc. units 
are formed in the same manner until each subgroup contains all of the units 
in the block. The optimum sequence is the one that has the maximum number 
of variables associated with feed forward streams. 

Sargent’s algorithm is easily programmed on a digital computer especially 
if a list processing language is used; however, the storage requirements are 
quite large. Sargent indicates that 1144 words of storage are needed to process 
a block of eight units, and that the storage requirements approximately 
double for each additional unit. Thus 16 units would take roughly 250,000 
words of storage if an extended extrapolation is made. 

Lee (L2) proposed a different algorithm for tearing, which also obtains the 
tears for the minimum number of variables associated with the torn streams. 
The procedure is completely different from that of Sargent’s in that the loops 
of material flow are determined first and tears made so that all the loops are 
broken. The algorithm requires a previous knowledge of the loops of material 
flow and which units and streams are included in each loop. The loops can 
be found by forming the adjacency matrix for the process, and determining 
the loops in the matrix by Steward’s method as described in Section IV. 

Lee’s algorithm makes use of an occurrence matrix in which the rows cor¬ 
respond to the loops and the columns correspond to the streams. A nonzero 
entry is placed in column j and row i if stream j is included in loop i. The first 
step of the algorithm is to obtain a set of independent columns. The inde¬ 
pendent columns are obtained by removing a column, k , from the occurrence 
matrix if (1) there is another column, j, which corresponds to a stream with 
less variables than the stream represented by column k\ and (2) column j 
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contains nonzero entries in all of the rows in which column k also contains 
nonzero entries. Columns are removed according to these criteria until no 
more such columns exist in the matrix. The remaining columns then are in¬ 
dependent. The columns removed correspond to the poorest possible tears. 
Tearing one of the streams corresponding to an independent column breaks 
at least all of the same loops as the removed column and a smaller number of 
variables would be assumed in solving the process equations. 

In the residual occurrence matrix comprised solely of independent 
columns, if a row contains only one nonzero element, the stream correspond¬ 
ing to the column in which the nonzero element appears is the only stream 
that can be torn to break the loop corresponding to that row. The next step 
of the algorithm is (1) to remove the columns in which a lone nonzero element 
in a row appears and (2) to remove all of the rows that have nonzero elements 
in that column. Each column removed in this step corresponds to tearing the 
stream represented by the column, and all of the rows removed represent 
loops that are broken by the tear. 

After all of the rows that contain only one nonzero element are removed 
from the matrix, the columns that are strictly contained in a set are removed. 
A column, k , is strictly contained in a set of columns if (1) at least one column 
in the set contains a nonzero element in each row in which column k also has 
nonzero entries, and if (2) together, the columns of the set have less variables 
associated with the streams they represent than does the stream represented 
by column k. The columns representing streams with the largest number of 
variables that are strictly contained in the set are removed first, after which 
the reduced matrix is checked for rows with only one nonzero element. 
Whenever a row is obtained with only one nonzero element, it is removed and 
a new tear is performed as before. 

Once all of the rows have been removed from the reduced occurrence 
matrix, the tears corresponding to the minimum number of variables that 
must be assumed in the solution of the process material and energy balances 
have been found. 

Lee does not give a systematic method of ordering the units after tearing 
except by inspection of the graph with the torn streams removed. The method 
is satisfactory for tearing small systems by inspection but cannot be pro¬ 
grammed without supplementation by a systematic method of ordering. 


VII. Comparison of Precedence Ordering Techniques 

Various methods of carrying out the different phases of decomposition 
are compared in this section. Only one method of finding an output set has 
been reported in the literature, namely that of Steward (see Section III). 
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Steward’s method is quite efficient and very easily programmed for a digital 
computer. Subroutine outset in the fortran program listed in Appendix A 
executes Steward’s algorithm for finding the output set. Table II lists the 
execution times on a Control Data Corporation 6600 computer to obtain the 
output sets of several problems of increasing scale. No attempt has been made 
to relate the computation times versus the number of equations by a function 
because the times will not only vary with the number of equations but also 
with the number and location of the nonzero elements in the occurrence 
matrix. However, since the primary operation performed in the algorithm is 
searching of the rows and columns of the occurrence matrix, since the number 
of searches performed is proportional to the number of equations in the sys¬ 
tem, and since the length of each search is limited by the number of equations, 
the times should be roughly proportional to the square of the number of 
equations. 

The three methods of partitioning discussed in Section IV were program¬ 
med for a digital computer, and Table III compares the execution times on 
the CDC 6600 computer to partition the same problems as in Table II. It 
was found that Steward’s algorithm was the most efficient for large systems 
and Billingsley’s method the most efficient for small systems. From Table III 
it can be seen that Steward’s algorithm was executed in less time than Billings¬ 
ley’s only for two examples; however, those examples contained the greatest 
number of loops of information flow within the maximal loops. It can be 
expected that in most large scale problems the number of loops within maxi¬ 
mal loops will be large and therefore Steward’s algorithm will usually be more 
efficient. Subroutine steward in the fortran program listed in Appendix A 
executes Steward’s algorithm. 

Himmelblau’s algorithm discussed in Section V for finding the disjoint 
subsystems is easily programmed on a digital computer and quite efficient. 
Because the algorithm for finding the disjoint subsystems was so straight¬ 
forward and efficient, no other methods have been considered here. A com¬ 
puter program to find the disjoint subsystems in a process was not included 
in Appendix A because the analysis on disjoint subsystems should be carried 
out prior to the more detailed decomposition of partitioning and tearing. 
Table IV lists execution times for several problems by a program prepared by 
the authors and available from them. 

The two methods for tearing systems of equations described in Section 
VI,B could not be compared directly because no computer program was 
available to execute Steward’s algorithm. However, some qualitative observa¬ 
tions can be made with respect to the computer storage requirements and 
computation time, which would be required for executing Steward’s algorithm 
on a digital computer. First of all, the storage requirement for storing the 
numbers of all of the equations that appear in each loop would be quite large. 
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TABLE II 

Time to Obtain an Output Set by 
Steward’s Algorithm 


Number of equations 

Time (sec) 

12 

0.017 

24 

0.088 

59 

0.423 

71 

0.588 

95 

1.116 

112 

1.920 

126 

1.272 

134 

1.846 

143 

1.491 

148 

1.616 


TABLE III 


Computation Times por Partitioning 


Number of equations Steward 

Time (sec) 
Billingsley 

Boolean Powers 

12 

0.029 

0.022 

0.059 

17 

0.059 

0.083 

0.116 

18 

0.087 

0.055 

0.162 

24 

0.181 

0.103 

0.231 

42 

0.694 

0.315 

0.745 

59 

1.540 

0.649 

1.457 

71 

2.533 

1.180 

2.273 

95 

5.823 

2.537 

4.163 

113 

9.555 

3.455 

5.798 

126 

11.959 

13.093 

36.530 


TABLE IV 

Computation Times to Find Disjoint Subsystems 


Number of equations 

Time (sec) 

42 

0.231 

59 

0.673 

71 

1.210 

95 

2.598 

113 

4.009 
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Second, the adjacency matrix would require n 2 words of storage where n is the 
number of equations in the system. In finding the loops by either the method 
described here or by the other method proposed by Steward (S3), all of the 
loops are found more than once, which tends to reduce the efficiency of the 
procedure. Finally, since no exact criteria was given by Steward for evaluating 
the effectiveness of each tear, all possible tears must be performed and the 
best tear chosen by inspection of all the tears. Steward’s algorithm, however, 
is simpler than Ledet’s algorithm, and therefore better suited for decompo¬ 
sition of small systems by hand. 

In Ledet’s algorithm, if both the first and second phases are carried out to 
obtain the absolute minimum number of tears, the computer storage required 
is quite large and the amount of computation time will be unreasonable. 
However, if only the first phase is executed, the number of iterates will be 
drastically reduced and in many cases the minimum number of tears will be 
found anyway. The computer storage required to perform the tearing with 
only the first phase is quite reasonable and the computer execution time is 
short compared to the time required to solve the system. To be specific, the 
storage required for the computer program listed in Appendix A is approxi¬ 
mately n 2 + 11,000 words, where n is the number of equations in the system. 
Table V lists the execution times on the CDC 6600 computer to accomplish 
the tearing for the same problems as in Table II. The unusual distribution of 
times was caused by the quite different structure of information flow in each 
problem. 

Ledet’s algorithm has the advantage that a realistic tearing can be accom¬ 
plished because the output variables chosen by the algorithm can be controlled 
to correspond with the accustomed information flow for the solution of each 


TABLE V 

Execution Times for Tearing by the First 
Phase of Ledet’s Algorithm 


Number of equations 

Time (sec) 

12 

0.026 

24 

0.351 

59 

0.894 

71 

0.893 

95 

2.183 

112 

30.074 

126 

31.455 

139 

88.593 

143 

5.800 

148 

3.675 
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equation, and the choice of variables as iterates can be limited to those vari¬ 
ables for which it has been determined that the solution technique will con¬ 
verge. 

In tearing of the material and energy flows between units as described in 
Section VI,C, the method proposed by Sargent is more effective for digital 
computation. Unfortunately, though, the storage requirements are high, and 
because of the combinatorial approach in the ordering of units, the time re¬ 
quired for tearing would be substantial. However, the advantages of Sargent’s 
algorithm are (1) that it obtains the ordering with absolute minimum number 
of variables associated with the recycle streams, and (2) that the order of 
solving the units is designated. It should also be pointed out that in most 
processes the number of units in a process will be considerably less than the 
number of equations that mathematically describe the units, so that Sargent’s 
algorithm may prove adequate. 

In Lee’s algorithm the amount of storage required for accomplishing the 
tearing would be small compared to Sargent’s, but the storage required 
initially to find the loops would be almost as large as that required in Sargent’s 
algorithm. 

VIII. Two Examples 

To illustrate a complete decomposition for a process represented by a 
large number of equations is not feasible in this review because of the practical 
difficulties of printing the resulting matrices. Also, the problem statements 
would probably be longer than the review itself. Consequently, decomposition 
of two modest-sized processes is illustrated in this section, one a process of 
sufficient scale to be quite impracticable to decompose by inspection. The 
first example model has been taken from “Analog Simulation of the Hanford 
N-Reactor Plant” (S5), a documentation readily available from the Clearing¬ 
house for Technical and Scientific Information as well as AEC depositories. 

An analog simulation model of the Hanford N-reactor and its principal 
coolant systems had been used to study the dynamic characteristics of the 
plant; to determine the effects of plant, operator, and control system mal¬ 
functions; and to study various operational procedures and proposed design 
changes. Figure 19 shows the process flow diagram. The basic subsystems 
modeled were the reactor, a pump in the primary loop, and a steam generator 
and a dump condenser in the secondary loop. Each pump, steam generator, 
or dump condenser could be augmented to represent any desired number of 
parallel units by repetition of the model. 

The size and complexity of the N-reactor plant and the limited amount of 
computing equipment that was available necessitated a judicious use of sim¬ 
plifying assumptions. For instance, primary coolant temperature transport 
lags were lumped into two groups, one each for the hot and cold loop legs; 
thermodynamic effects in the secondary system condensate headers and surge 
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tanks were neglected; and the reactor was considered to act as a lumped sub¬ 
system. For many types of studies, these assumptions resulted in only second- 
order errors. 

To provide this first example, a system of 112 equations were taken from 
the original reference comprising the following subsystems: 

Primary Loop: 

(1) Reactor (power level model with scram and power setback trip 
circuits). 

(2) Primary coolant hydraulic model (one primary pump). 

(3) Primary coolant thermal model with lumped transport and mixing 
lags (the transport lag was varied as a function of the flow rate). 

(4) Steam generator primary side (one steam generator model with bypass 
valve flow and thermal characteristics). 

(5) Automatic primary coolant flow control (both total and loop flow 
controllers). 

(6) Automatic primary coolant temperature control with inlet and outlet 
temperature measurement lags and associated control devices. 

Secondary Loop: 

(1) Steam generator secondary side. 

(2) Main steam header. 

(3) Dump condenser and butterfly control valve. 

(4) Surge tank (including auxiliary standpipe volumes). 

(5) Condenser header and pumping system. 

(6) Steam generator level control system. 

(7) Main header pressure control system. 

(8) Surge tank level and inventory for normal fill, emergency fill, and 
emergency spill systems. 

(9) Condensate/main header differential pressure control system. 

Of the 112 equations, about 12% were ordinary differential equations, 
75% were algebraic equations, 10% were integral equations, and 3% were 
transfer functions. About 40% were nonlinear equations. The detailed list of 
equations and variables is far too long to be repeated here; the equations take 
fourteen pages to list in the original reference. The interested reader will find 
the derivation of the equations carefully described and the equations them¬ 
selves clearly arranged by subsystem in the original reference. 

In order to decompose the system equations successfully, it was necessary 
to prepare carefully the occurrence matrix (of equations and variables) for 
introduction into the computer program listed in Appendix A. The major 
difficulty that faces any analyst is distinguishing between system variables and 
parameters included in the equations. Actually, several sets of system vari¬ 
ables could have been chosen depending upon whether one wished to study 
the design or simulation of the N-reactor plant. It was found that assigning 
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an output set to the equations by hand helped immensely in isolating the 
parameters in the equations and also in detecting redundant equations, which 
were presented in the article for clarity. Groups of three engineers were as¬ 
signed the task of decomposing the system by inspection, but this did not 
prove possible. 

The recorded occurrence matrix after partitioning and tearing obtained 
from the computer program listed in Appendix A for the N-reactor problem 
is shown in Appendix B. Fifty-two seconds were required to execute the pro¬ 
gram; thirty manhours were required to set up the original occurrence matrix. 
The zeros in the matrix have been replaced by dots so that the nonzero ele¬ 
ments can be more easily discerned. A box has been drawn around each set 
of rows and columns that represent an irreducible subsystem. A box included 
within another box indicates a subsystem of equations that must be solved 
simultaneously within a larger set of simultaneous equations. The column 
labeled EQN. in Appendix B is a list of the equations corresponding to the 
original sequentially numbered rows of the occurrence matrix. The second 
column, labeled VAR., is a list of the output variables corresponding to the 
adjacent equation in the column labeled EQN. Because the output set ele¬ 
ments have been arranged to lie on the main diagonal, each column represents 
the variable on the main diagonal. The variable can be identified by proceed¬ 
ing across a row from the main diagonal to the column labeled VAR. 

The matrix in Appendix B shows that the system was partitioned into 
three main blocks and nineteen subsidiary blocks, each comprising a single 
equation that could be solved for one variable in the system. The largest block 
in the system includes (a) the equations of the primary coolant loop between 
the steam generator and the reactor, and (b) the equations of the secondary 
coolant loop between the steam generator and the dump condenser. The two 
smaller blocks correspond to the equations of an injection system in the 
primary coolant loop (not shown in Fig. 19). Each loop indicated by a box 
in the occurrence matrix in Appendix B has one iterate and the last column 
(right-hand column) in the box corresponds to the iterated variable. Note 
that there are no nonzero elements above the main diagonal except in the 
columns corresponding to iterates. 

The total number of tears in the entire system was 14. No further reduc¬ 
tions in the number of tears could be found by inspection of the matrix. 

As a second example, consider the classic multiple effect evaporator with 
forward feed, illustrated in Fig. 20. The problem is as follows: 

A feed-forward triple-effect evaporator is to concentrate 5 % NaOH to 50 % NaOH. 

Feed enters at 60°F. Overall heat transfer coefficients are: Ui = 800, U n = 500, 

Uni — 300 Btu/(hr)(ft 2 )(°F). Steam entering steam chest I is at 125 psia, saturated 
(344°F) while the vacuum on effect number III is 1 psia (102°F). Ten tons/hr of 50% 
solution are to be produced. Find the heating areas if they are all equal. 

A graph of the flows indicates no recycle streams exist, but the system cannot 
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Fig. 20. Triple effect evaporator. 

be decomposed successfully by a linear sequence of calculations because many 
of the system equations are nonlinear. 

The steady state material and energy balances for the evaporator are 
listed in Table VI and VII, and the notation in Table VIII. Table IX lists 
the enthalpy relationships for the various streams as well as the boiling point 
versus pressure and concentration relationships in functional form for NaOH 
solutions and pure water. The list of unknown variables and the numbers 
assigned to each is given in Table X. At this stage in the analysis there are 25 
equations and 27 unknown variables. Another pair of equations comes from 
the problem statement in which the following is given 

A i = A n = A ul 

to give a total of 27 equations in 27 unknowns. 

Table XI lists the equation numbers and the variables that appear in each 
equation. The numbers in parentheses are the assigned numbers of each 
variable. Only five of the material balances are independent, and those selected 
are designated as Eqs. 1-5 in Table XI. Altogether there are six independent 
material balances, but because the overall balance for NaOH can be solved 
directly for F, Fis removed from the role of a variable and the number of in¬ 
dependent material balances can be reduced to five. The five selected were 

(1) NaOH balance, effect II 

(2) NaOH balance, effect III 

(3) Total balance, effect I 

(4) Total balance, effect II 

(5) Total balance, effect III. 

The total balances were selected in lieu of the component water balances be¬ 
cause fewer variables were involved in the equations. The energy balances of 
Table VII form Eqs 6-12, in the same order, in Table XI. Eqs. 13-18 are the 
enthalpy relationships for the NaOH solution in each effect and the water 
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vapor in each effect in the same order as in Table IX. Equations 19-21 are 
the boiling point relationships for each effect while Eqs. 22 and 23 are the 
condensate-temperature relations for effects II and III, respectively. Equa¬ 
tions 24 and 25 are the statements that the areas should be equal. The last two 
equations are the enthalpy versus pressure relationships for the condensate 
from the steam chests of effects II and III, respectively. Values of certain of 
the variables as specified in the problem statement, such as the steam tem¬ 
perature, the vapor temperature from the last evaporator, etc., have been 
substituted directly into the system equations. 

The occurrence matrix for the evaporator problem was formed and ana¬ 
lyzed using the computer program listed in Appendix A. Figure 21 is the 
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13 
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Fig. 21. Reordered occurrence matrix for the triple effect evaporator system illustrated 
in Fig. 20. 
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TABLE VI 
Material Balances 


NaOH 

Effect I 

(0.05) (F) = <ii, b L, 

Effect II 

v)\ b L\ =i u>i La 

Effect III 

co„ t £„ = (0.50)(20 > 000) 

Over-all 

(0.05) (F) = (0.50)(20,000) 

Water 

Effect I 

(0.95) (F) = <o, w L, + V, 

Effect II 

(x)\ w L\ — ojh w L\\ + Vu 

Effect III 

(on Lu w - (0.50)(20,000) + Vni 

Over-all 

(0.95) (F) = (0.50)(20,000 + V,) 

+ Vn + Vu 

Total 

Effect I 

F = F,+ V, 

Effect II 

Li = Lu + Vu 

Effect III 

Lu = Lin + ii 

Over-all 

F=L llt + V,+ V„ + V„, 


TABLE VII 
Energy Balances 


Effect I 

(1191.0)5 + (26)(F) = (315.6)5 + H Lx L x + H Vl V y 
Effect II 

H Vx V x + H hx U * Hd\\V\ + H Ln L n + Hy n V n 
Effect III 

H Vn V u + H Ln L n = H Dm V n + ff Lm (20fi00) + H Vm V nl 
Condenser 

Hy m V U l = G+(6 8)(F„.) 

Steam chest I 

5(1191.0 - 315.6) = (800)(A,)(344 - T,) 

Steam chest II 

V,(Hv, - Hou) = (500 )(^„)(r DlI - T n ) 

Steam chest III 

V n (H Vu - H d ,„) = (300)(/l III )(r DlII - 7i„) 
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TABLE VIII 
Notation 


A 

Area of heat transfer in steam chest 

F 

Mass flow rate of feed 

H 

Enthalpy per unit mass 

L 

Mass flow rate of solution 

Q 

Condenser duty (rate of energy removed) 

P 

Pressure 

s 

Mass flow rate of steam 

T 

Temperature 

V 

Mass flow rate of vapor 

Greek 

co b 

Mass fraction NaOH in solution L 

w w 

Mass fraction water in solution L 

Subscripts 

I, II, III 

Effect number 

D 

Condensate 

L 

Liquid 

V 

Vapor 


TABLE IX 

Physical Property Relationships 


NaOH solution enthalpies 
H Li =/ 1 (7’„ oV) 

H Lu = w b u ) Available in the form of charts 

H Lul =/uii(Tin, o»iii) 

Water vapor enthalpies 
H Vl =g,(T„P,) 

Hvn = g u (T u , Pn) Superheated steam from the steam tables 
Hv\u = £7iii(7iii, 1 psia) 

NaOH solution boiling points 
r, =k l (P li oj l b ) 

Tu = ku(Piu ton) Duhring charts and steam tables 
Tiii — km (1 psia, 0.5) 

Condensate temperatures 

T Dl i = ffi(Pi) Saturated steam from the steam tables 

T Dlll =9n{P\\) 

Condensate enthalpies 

H Dn — h u (T Dn ) Saturated steam from the steam tables 

Ham ~ ^iiiCLdh,) 
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TABLE X 


Unknown System Variables and Assigned Numbers 


Variable number 

Variable 

Variable number 

Variable 

1 

OJ t b 

14 

H v\xx 

2 

(X)u 

15 

Hon 

3 

L t 

16 

Hd\u 

4 

■Ln 

17 

Q 

5 

V, 

18 

T> 

6 

Vn 

19 

T n 

7 

V,n 

20 

T m 

8 

s 

21 

T Dll 

9 

Hu 

22 

Tdxxx 

10 

Hl u 

23 

Ax 

11 

Hl ui 

24 

Axx 

12 

Hy , 

25 

Am 

13 

Hv n 

26 

Px 



27 

Pi i 


reordered occurrence matrix obtained by the program. In order to obtain this 
sequence of equations it was necessary to specify that none of the enthalpies 
could be chosen as output variables of Eqs. 6-9, 11, 12, and only the enthal¬ 
pies could be chosen as output variables in Eqs. 13-18 and Eqs. 26 and 27 
(by prohibiting all the other variables from being selected). Specifying these 
prohibited variables was necessary because of the difficulty of determining 
temperatures, pressures, or concentrations from the steam tables or Duhring 
plots when the enthalpy is given. If these prohibited variables were not speci¬ 
fied, another precedence ordering of the system equations was obtained in 
which an enthalpy was assigned as the output variable of one of the energy 
balances. In attempting to solve the system in the designated precedence order 
in this latter case, the value of the enthalpy calculated in the energy balance 
was not within the range of values given in the steam tables unless the assumed 
values of the iterates were quite close to the correct values. 

The classical procedure to solve the triple effect evaporator problem is as 
follows: 

(1) Assume values for > ^ii > ^iii . Solve the material balances in sequence 
for Lj, L n , cDj!. 

(2) Assume values for T x , T n . Find the values of Ton and To m from 
the Duhring charts. 

(3) Solve the energy balances simultaneously for S , F,, F n . Use the values 
of V x , V n , and the material balances to check the assumptions for step 1; 
iterate, if necessary. 
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TABLE XI 


Equations and Unknown Variables for Triple Effect Evaporator 


Equation number 

Variables 0 

1 

<(1),L(3), wf,(2), £„(4) 

2 

o>?,(2), £„(4) 

3 

LA 3), K,(5) 

4 

LA 3). L, .(4), K„(6) 

5 

£n(4), Lin(7) 

6 

5(8), H Ll ( 9), LA 3), HvA 12), H(5) 

7 

HrA 12), K,(5), H Ll ( 9), /7 0 „( 15), ft,,(10), L,A 4), 

Hy,A 13), r„(6), 1,(3) 

8 

ft„( 13), F„(6), ft„(10), L„(4), ft„,(16), K„(6), 
ft,,,(11), ft,„(14), K,„(7) 

9 

ft,,,(14), K„,(7), 2(17) 

10 

5(8), ,4,(23), 71(18) 

11 

K,(5), ft,(12), ft,,(15), A u (24), r Dll (21), 71,(19) 

12 

K„(6), ft„(13), ft,,,(16), ,4„,(25), r DlI1 (22), 

r„,( 20 ) 

13 

ft,(9), 71(18), w, 4 5 6 (l) 

14 

ft„(10), 71,(19), oj b u (2) 

15 

ft,„(ii), r„,(20) 

16 

ft ,(12), TA 18), P,(26) 

17 

ft„(13), 71,(19), P„(27) 

18 

ft,„(14), r,„(20) 

19 

71(18), P,(26), 6u,‘(l) 

20 

71,(19), P„(27), tof,(2) 

21 

71„(20) 

22 

7d,,(21), P,(26) 

23 

7d,„(22), P,A 27) 

24 

.4,(23), .4„(24) 

25 

A,A24), A„A 25) 

26 

ft„(15), 7’ CI1 (21) 

27 

ft,,, (16), r c ,„(22) 


° The numbers in parentheses are the assigned variable numbers. 


(4) Solve the steam chest equations in sequence for A l9 A u ,A m . 

(5) Check the values of A ] , A u , A m to see if they are equal; if not, start 
with step 1 or step 2 again. 

(6) Calculate Q from the condenser equation. 

The classical method of solution is effective and the iterates are quite 
insensitive to the initial assumptions, if properly chosen as described in books 
on unit operations. No claim is made that the precedence scheme of Fig. 21 
is either better or quicker than the classical method. However, the procedure 
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can be applied automatically to other more complicated problems for which 
calculational schemes have not yet been developed. For large scale systems 
it is quite difficult to develop precedence schemes by inspection alone, and the 
algorithms presented here should be quite helpful in determining the best 
precedence schemes. 
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Appendix A. Computer Program for Precedence Ordering 


PROGRAM MATRIX(OUTPtJT, INPUT) 

PROGRAM MATRIX FOR DECOMPOSITION OF LARGE SCALE SYSTEMS 

THE OBJECTIVE OF THE PROGRAM IS TO ARRANGE IN PRECEDENCE 
ORDER FOR SOLUTION A SYSTEM OF N EQUATIONS AND N VARIABLES, 

INPUT DATA CARDS 

GROUPS OF FIELD DESCRIPTION OF DATA 

DATA CARDS COLUMNS FORMAT IN FIELD 


1 1-3 13 NUMBER OF EQUATIONS IN THE SYSTEM, 

2 1-3,5-7, 20(13, THE NONZERO ELEMENTS OF THE OCCUR- 

9-11, ,, IX) RENCE MATRIX ARE GIVEN IN TERMS OF ORDER 

PAIRS ALTERNATING BETWEEN EQUATION NUMBER 
AND VARIABLE NUMBER. THE ORIGINAL AS¬ 
SIGNMENT OF EQUATION AND VARIABLE NUMBERS 
ARE ARBITRARY. ENTER IN COLUMNS 1-3 AN EQ¬ 
UATION NUMBER. ENTER IN COLUMNS 5-7 
ONE OF THE VARIABLE NUMBERS IN THE 
SELECTED EQUATION. IN COLUMNS 9-11 
ENTER AN EQUATION NUMBER (THE SAME EQ¬ 
UATION OR ANOTHER) AND IN COLUMNS 13- 
15 ENTER THE NUMBER OF ONE OF THE VAR¬ 
IABLES IN THE SPECIFIED EQUATION. 

CONTINUE INTRODUCING EQUATION NUMBERS 
AND VARIABLE NUMBERS UNTIL ALL THE NONZERO 
ELEMENTS HAVE BEEN ACCOUNTED FOR. EACH CARD 
MUST BE FILLED COMPLETELY, EXCEPT THE 
LAST CARD WHICH MUST HAVE TWO BLANK 
FIELDS, OR THE READING WILL STOP. 

NOTE THERE IS A SPACE BETWEEN FIELDS. 

EVERY NUMBER MUST BE RIGHT JUSTIFIED 
IN THE FIELO. THE LAST CARD MUST CONTAIN AT 
LEAST TWO BLANK FIELDS FOLLOWING THE 
DATA. THIS MAY NECESSITATE ADDING A BLANK 
CARD AT THE END OF THE DATA. 

3 1-3,5-7, 20(13, THESE DATA CARDS READ IN THOSE VARIABLES 

9-11, •• IX) WHICH ARE PROHIBITED FROM BEING ASSIGN¬ 
ED AS OUTPUT VARIABLES IN A PARTICULAR 
EQUATION. THE DATA MUST BE PRESENTED 

IN PAIRS AS ON DATA CARDS IN GROUP NO. 2. 
ENTER IN THE FIRST FIELD OF THE PAIR AN 
EQUATION NUMBER AND IN THE SECOND FIELD 
THE NUMBER OF A VARIABLE IN THE EQUATION 
WHICH CANNOT BE ASSIGNED AS THE OUTPUT 
VARIABLE OF THAT EQUATION. REPEAT FOR 
ADDITIONAL PROHIBITED VARIABLES. EVERY 
FIELD MUST BE FILLED ON EACH CARD EXCEPT 
THE LAST DATA CARD WHICH MUST HAVE AT 
LEAST TWO BLANK FIELDS FOLLOWING THE 
DATA. THIS MAY NECESSITATE ADDING A BLANK 
CARD AT THE END OF THE DATA. 

4 1-3,5-7, 20(13, THESE DATA CARDS READ IN ANY VARIABLES 

9-11, .. IX) WHICH ARE PROHIBITED FROM BEING CHOSEN 

AS ITERATES. IN THE FIRST FIELD ENTER 
A VARIABLE NUMBER WHICH SHOULD NOT BE 
CHOSEN AS AN ITERATE, IN THE SECOND 
FIELD ENTER ANOTHER VARIABLE NUMBER WHICH 
SHOULD NOT BE AN ITERATE. AT LEAST ONE 
OF THESE CARDS IS REQUIRED, EVEN IF NO 
VARIABLES ARE TO BE EXCLUDED AS ITERATES. 
THIS MAY NECESSITATE ADDING A BLANK CARD 
AS DATA. 
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C**»* PROGRAM OUTPUT 

C*««« ■■ --- ——- 

c###* THE OUTPUT FROM THIS PROGRAM IS THE FINAL REORDERED OCCURRENCE 
C**## MATRIX OF THE SYSTEM. 

C**»* 

C*#** FORM OF THE OUTPUT WILL APPEAR AS IN THE FOLLOWING SAMPLE 


c**#* 

OUTPUT MATRIX 

(X.X) 

c#**# 

EQNS. 

VARS. 


c«*»* 

3 

5 

1.1 

c**#* 

2 

1 

• 1. 

C#tt# 

1 

2 

1.1...1 

c**** 

7 

3 

•1.1... 

c**»* 

6 

4 

11111.. 

c***# 

4 

6 

1..1.1. 

c*»»# 

c**** 

5 

7 

. . . 1 . . 1 


C**** THE PERIODS IN THE OCCURRENCE MATRIX INDICATE ZEROS FOR EASE OF 
C#*#* READING. THE MATRIX OUTPUT SET IS ON THE MAIN DIAGONAL. AND THE ROWS 
C**#* ARE IN THE FINAL PRECEDENCE ORDER. THE ELEMENTS IN A MATRIX ROW ARE 
C«*#« THOSF VARIABLES ASSOCIATED WITH THE EQUATION GIVEN UNDER THE 
C**## COLUMN HEADED EQNS- THE COLUMN HEADED VARS. IDENTIFIES THE OUTPUT 
C#*#* VARIABLE FOR THAT EQUATION. BY TRANSPOSING THE VECTOR OF NUMBERS 
£**«* UNDER THE COLUMN VARS. AND PLACING IT ACROSS THE TQP OF THE MATRIX!X.X) 
C*#** YOU CAN IDENTIFY EACH OF THE VARIABLES BY NUMBER. EACH COLUMN WITH 
C**«* NONZERO ELEMENTS ABOVE THE MAIN DIAGONAL REPRESENTS AN ITERATED 
C*#** VARIABLE. 

c**#* 

c#**# if THERE ARE MORE THAN 112 EQUATIONS IN THE SYSTEM. ONLY 112 COL- 
C#**# UMNS OF THE MATRIX WILL BE PRINTED ON EACH PAGE. THEN THE MATRIX 
C**## SHOULD BE PUT TOGETHER AS FOLLOWS. 

C«*** OUTPUT MATRIX (1*1) OUTPUT MATRIX (1.2) ... ETC. 

C*#** 

C**»* IF THE PROGRAM CANNOT FIND AN OUTPUT SET. A LIST OF EQUATION 
C#*** NUMBERS IS GIVEN WHICH COMPRISES THE SUBSET IN WHICH THE EQUATIONS 
C *##« ARE GREATER IN NUMBER THAN THE NUMBER OF VARIABLES APPEARING IN 
C**## ALL OF THE EQUATIONS OF THE SUBSET. THE PROBLEM MUST BE REVIEWED 
C**»* TO MAKE IT DETERMINATE. (IF THE NUMBER OF EQUATIONS IS LESS THAN 
C#*** THE NUMBER OF VARIABLES. THE PROGRAM WILL PROCESS THE EQUATIONS.) 

C***» 

DIMENSION K(162.162)* IND (162).JND(162)* ISTOR(665)♦IOUTST(162).INZ( 
110)«JNZ<20) 

COMMON IND.JND.N.NPO.K.IOUTST 
COMMON /A/ ISTOR. IL 
COMMON /L/ LOOP(162.11) 

COMMON /T/ LI(20) . L0(20) 

COMMON /N/ KSTOR(665).IKJ 
DIMENSION TIND(162)* JJND(162> 

C*#*# THE MAIN PROGRAM MATRIX ONLY READS THE OCCURRENCE MATRIX AND CALLS 
C*#** SUBROUTINES TO DO THE DECOMPOSITION. 

C**#* THE OCCURRENCE MATRIX IS STORED IN MATRIX K. THE MATRIX LOOP IS 
C***# USED TO STORE THE NON OUTPUT ELEMENTS OF THE EQUATIONS. THE VEC- 
C#**# TOR ISTOR IS USED TO STORE THE NONZERO ELEMENTS OF THE OCCURRENCE 
C***# MATRIX. N IS THE NUMBER OF EQUATIONS. IND AND JND ARE USED TO 
C*#** KEEP THE CORRESPONDENCE BETWEEN THE EQUATION NUMBERS AND THE ROW 
C*##* NUMBERS AND THE VARIABLE NUMBERS AND THE COLUMN NUMBERS RESPECT- 
C***# IVELY. THE VECTOR IOUTST IS USED TO STORE THE OUTPUT SET. 

1 READ 5.N 

C**## THE NUMBER OF EQUATIONS IS READ HERE. 

NPO * N +1 
6 FORMAT(13.117) 

NPT » N + 2 
DO 10 I * 1♦N 
IND(I) = I 
IOUTST(I) = 0 
JND(I) * I 
DO 10 J * l.N 
10 K(I.J) = 0 
IL * 0 

C***# THE NONZERO ELEMENTS OF THE OCCURRENCE MATRIX ARE READ HERE. 

40 READ 50 *(INZ<L) .JNZ(L).L* 1.10) 

60 FORMAT(20(13 * 1X)) 

DO 60 L * 1*10 
IFMNZ(L).EO.O) GO TO 70 
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I * IN2(L > 

J * JNZ(L) 

K(I ♦ J) = 1 

il = il *i 

C**** THE NONZERO ELEMENTS ARE STORED IN THE VECTOR ISTOR HERE. 

ISTOR IILI * 1000*1 + J 
60 CONTINUE 
GO TO 40 

C**** THE NONZERO ELEMENTS ARE ORDERED FOR EASY REFERENCE. 

70 00 80 1 = 1* IL 
DO 80 J= I * IL 

IF(lSTOR( lULE.ISTORt Jll GO TO 80 
KDUM * ISTORI 1) 

ISTOR(I) = ISTOR(J) 

ISTORU) * KDUM 
80 CONTINUE 

DO 81 I = 1»N 
DO 01 J * 1.11 
01 LOOP(I.J) * 0 
IK J « 0 

C**** THE NON OUTPUT ELEMENTS OF THE OCCURRENCE MATRIX ARE READ HERE. 

82 READ 50 » <INZ(L )*JNZ(L)*L = 1*10) 

DO 03 L * 1*10 

IFI INZ(L).FQ.O) GO TO 85 
I « INZ(L) 

IKJ * IKJ +1 

KSTOR(IKJ) « 1*1000 + JNZfL) 

C***» THE NUMBER OF NON OUTPUT ELEMENTS IN EACH EQUATION IS COMPUTED 
C#*** AND STORED IN THE 11TH COLUMN OF LOOP. 

LOOP(1.11) = LOOP(1*11) +1 
IA = LOOP(1*11) 

83 LOOP(I.IA) » JNZ(L) 

GO TO 82 

85 CONTINUE 

DO 87 I * l.IKJ 
DO 87 J » I.IKJ 

IF(KSTOR(!).LE.KSTOR(J)) GO TO 87 
KDUM * KSTOR(I) 

KSTOR(I) 8 KSTOR(J) 

KSTOR(J) * KDUm 

87 CONTINUE 

C**** THE NON OUTPUT ELEMENTS ARE REMOVED BEFORE FINDING AN OUTPUT SET. 

DO 91 I * 1 * N 

IE * LOOPH.ll) 

IFIIE.EO.O) GO TO 91 
DO 92 L « l.IE 
J * LOOP!I.L) 

92 K( I .J) »0 
91 CONTINUE 

C*#** SUBROUTINE OUTSET FINDS THE OUTPUT SET. IF NONE CAN BE FOUND FLAG 
C**»* IS SET » 1 AND THE PROGRAM TERMINATES. 

CALL OUTSET(1.N.FLAG* 1) 

IF(FLAG.NE.O) GO TO 999 

C***» THE NON OUTPUT ELEMENTS ARE RESTORED IN THE OCCURRENCE MATRIX SE¬ 
CT**** FORE PARTITIONING. 

115 DO 117 I * l.N 
IA * LOOP(I.11) 

IF(IA.EQ.O) GO TO 117 
DO 116 L * 1.IA 
JK = LOOP(I«L) 

DO 118 J * l.N 
IF(JND(J).FQ.JK) GO TO 116 
118 CONTINUE 

116 K(I *J) = 1 

117 CONTINUE 
MIN * 1 
MAX = N 
INDEX * 0 

C**** THE SUM OF THE NONZERO ELEMENTS IN THE ROWS AND COLUMNS ARE COMP- 
C**** UTED AND STORED IN THE N+l COLUMN AND ROW OF K RESPECTIVELY BY 
C***» SUMRC. 

120 CALL SUMRC(MIN.MAX) 

C**»* SUBROUTINE STEWRD PARTITIONS THE SYSTEM BY STEWARD S ALGORITHM. 
CALL STEWRD(MIN.MAX) 
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c*##* SUBROUTINE PACK RESTORES THE ORIGINAL NONZERO ELEMENTS OF THE OC- 
O*** CURRENCE MATRIX. 

CALL PACK(MIN.MAX.MTN.MAX.l) 

DO 130 J = 1 *N 
130 KIN+3.J) = o 

C***» THE VARIABLES WHICH CANNOT BE TAKEN AS ITERATES ARE READ HERE. 

13S READ 50t(JNZIL)tL*l*20) 

DO 150 L = 1.20 

TFUNZI L) .FQ.O) GO TO 155 

DO 140 J a 1*N 

IF!JND1J),FO.JNZ(L>> GO TO 145 
140 CONTINUE 
145 K f N+3 *J) = 1 
150 CONTINUE 
155 CONTINUE 

DO 200 1 = MIN.MAX 
IA = LOOPU.ll) 

IF(IA.EQ.0) GO TO 200 
DO 190 L * 1.IA 
JK = LOOPtI.L) 

DO 180 J = 1.N 
TF(JND(J).FQ.JK) GO TO 190 
180 CONTINUE 
1Q0 K U . J ) = 0 
200 CONTINUE 
250 MAX = 0 

C***» THE PARTITIONS ARE INDICATED IN THE N+2 COLUMN OF K AND THE FOL- 
C*»*» LOWING SECTION LOCATES ONE PARTITION AT A TIME FOR TEARING. 

DO 300 I = 2♦N 

IF ( K(I.NPT).GE.1000) GO TO 300 
IFOC<I.NPTl.NE.KtI-l.NPTI) MIN = I 
IF(KU.NPT) .EO.Kf 1-1 .NPTn MAX = I 

I F ( fKU-1 *NPT).EO.KI I.NPT) ) .AND. fK I 1 + 1 ,NPT ) • NE . K U . NPT ) ) > GO TO 350 
300 CONTINUE 

350 IF((INDEX.FO.O).AND.IMIN.EQ.0)) MIN = 1 
ICMD a MAX - M[N 

IFIMAX.EO.O.OR.MAX.GE.159) GO TO 500 
IF(ICMO.GT,1) GO TO 400 
K(MIN.NPT) = 1000 + MIN 
KfMAX.NPT) = 2000 + MAX 
GO TO 250 
400 CONTINUE 
INDEX = 1 

C**+* THE ROWS AND COLUMNS OF K ARE SUMMED AS BEFORE. 

CALL SUMRC(MIN.MAX) 

C*** + SUBROUTINE TEAR EXECUTES THE INITIAL PHASE OF LEDET S ALGORITHM. 
CALL TFARIMIN.MAX.L) 

C-**** THIS SECTION REORDFRS THE ROWS WITH THE CHECK EQUATIONS APPEARING 
C*#*# AS SOON AS POSSTBLF. 

DO 990 I a 1.L 

ILL - LIU) 

IUU = LOU ) 

IFUUU.LT.ILL) GO TO 990 
DO 980 J = ILL.IUU 
!F(K(NP3.JWEQ. 1 ) GO TO 980 
DO 970 IK = MIN.MAX 

97^ KUK.MAX-I + 1) = K( IK.MAX-I+I) .OR.KUK.J) 

980 CONTINUE 
990 CONTINUE 

LIIL + D = MAX-L+1 
DO 1400 IK = l.L 
I = L -IK" +1 
MXL * MAX -I 
DO 1100 rj a MIN.MXL 
J = MXL - IJ + MIN 
TEfKfMAX-1+1.J).FQ.1) GO TO 1110 
1100 CONTINUE 

mo I Fc J.LE.MAX-L ) GO TO 1180 
JDM a J “1 

00 1120 II = MTN.JOM 

rj = jom -ri + min 

IF(K(MAX-I+1.IJI.FQ.l) GO TO 1130 
1120 CONTINUE 
1130 LOU) = LO(MAX-J+11 
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I AM = MAX -J 
IPO = I +1 

DO 1150 I AC = IPO. I AM 
IFfLOUJ.GE. LOIIAC)) GO TO 1150 
LOIII * LOIIAC) 

1150 CONTINUE 

IFIIJ.GT.MAX-L) GO TO 1400 
IF! IJ + l.GT.LOf I)) LOU) = IJ +1 
GO TO 1400 

llflO IFU.GE.LI ( 1 + 1 ») GO TO 1200 
LOII) * J +1 
GO' TO 1400 

1200 DO 1220 IA = 2.L 

IF C J.GE.LI (IA-1) .AND.J.LT.LI(IAM GO TO 1230 
1220 CONTINUE 

I A a L +1 

1230 LO(I) = LO(IA-l) 

IFMA-l.EG. 1 + 1 ) GO TO 1250 
IPO * I +1 
IAM = IA -2 

DO 1240 I AC = IPO . IAM 
IFILOU ).GE.LO( IACM GO TO 1240 
LOU) = LOUAC) 

1240 CONTINUE 

1250 IF< J+l.GT.LOU U LOU) * J ♦! 

1400 CONTINUE 

ID = MIN -1 
MXL ■ MAX -L+l 
DO 1600 I a MIN.MXL 
DO 1450 IJ a l.L 
J » L-IJ +1 

IFU.NE.LOl J) ) GO TO 1450 
ID * ID +1 

II NO(ID) ■ IND(MAX-J+1) 

JJNDUD) * JNDtMAX-J+1) 

1450 CONTINUE 

IFU.EG.MXL) GO TO 1600 
ID * ID +1 
IINDUD) a INDU) 

JJNDIID) a JND ( I ) 

1600 CONTINUE 

DO 1650 I a MIN.MAX 
INDU ) a UNDID 
1660 JND(I 1 *JJND(I) 

CALL PACKIMIN.MAX.MIN.MAX.2) 

GO TO 250 
500 CONTINUE 

CALL PACK(1*N*1tN *2) 

£***• AFTER ALL OF THE TEARING IS DONE. SUBROUTINE PRNT PRINTS THE RE- 
C***» ORDERED OCCURRENCE MATRIX. 

CALL PRNT(1»N.1) 

999 CONTINUE 

C***» RETURN TO READ ANOTHER OCCURRENCE MATRIX. 

GO TO 1 
END 

C**»* 

c**»* 

SUBROUTINE PACK!MINR.MAXR.MINC.MAXC.KASE) 

C**»* THIS SUBROUTINE REPLACES THE ORIGINAL ELEMENTS IN ANY PART OF 
C###* MATRIX K. MI NR AND MAXR ARE THE FIRST AND LAST ROWS OF THE POR- 
C***« HON TO BE REPLACED. AND MINC AND MAXC ARE THE FIRST AND LAST COL- 
C#**# UMNS OF THE PORTION TO BE REPLACED. KASE INDICATES WHETHER TO RE- 
C**»* PACK ACCORDING TO THE OUTPUT SET IN OUTST OR ACCORDING TO THE 
C*»*» ORDER IN IND AND JND. 

DIMENSION Kf162.162)*IND(162).JND1162).ISTORl665).IOUTST(162).INZ( 
110).JNZtlO) 

COMMON /A/ ISTOR. IL 
COMMON IND.JND.N.NPO.K.IOUTST 
COMMON /N/ KSTOR(665).IKJ 
COMMON /L/ LOOPU62.il ) 

TRW ( I ) = ISTORUJ/IOOO 
I CL(I) = ISTORlI) -ISTORlI)/1000*1000 
DO 100 I a MI NR.MAXR 
IF{KASE.E0.2) GO TO 50 
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ji =* indui 

JNDII) a IOUTSTIJI) 

50 DO 100 J a minc.maxc 

inn K ( I * J) = 0 

DO 350 IK = MINR.MAXR 
DO 300 L a l,It. 

IR a IRWfL) 

I F ( IR - INOUKM 300*250*350 
250 JC = ICL!L) 

DO 275 JK = MINOMAXC 
IF(JC*EQ.JND(JK)I GO TO 260 
275 CONTINUE 
GO TO 300 
280 K(IK*JK) * 1 
300 CONTINUE 
350 CONTINUE 

DO 400 I = MINR *MAXR 
DO 400 J = 1*11 
400 LOOP(I*J> = 0 

IF(IKJ*EQ* 0) GO TO 600 
DO 500 I = MINR *MAXR 
IB a 0 

DO 450 J a 1,IKJ 
IA * KSTORIJ)/1000 
IF ( IA-INOIIM 450*430*500 
430 IB h IB +1 

LOOP(1*11) = IB 

LOOP(I*IB) a KSTOR(J) - KSTOR(J)/1000*1000 
450 CONTINUE 
500 CONTINUE 
600 CONTINUE 
RFTURN 
END 

C**** 

c**#* 

SUBROUTINE PRNT(MIN*MAX* IOP) 

C***# THTS SUBROUTINE PRINTS THE OCCURRENCE MATRIX* 
DIMENSION K(162*162)* TNDf162).JND(162) 

COMMON IND*JND*N*NPO*K 
NA * MAX -MIN +1 

NN = NA/112-I1/U + NA- NA/112*112)) +1 
DO 14 MMK a 1 , NN 
ID a 1 
PRINT 6 
6 FORMAT!1H1) 

IF(IOP)1 * 1*3 

1 PRINT 2*1D *MMK 

2 FORMAT! 44X* *INPUT MATRIX (**I2*****12**)*) 

PRINT 16 

16 FORMAT! 2X.*EQN.**2X**VAR.*) 

GO TO 5 

3 PRINT 4*ID.MMK 

4 FORMAT!44X• *0UTPUT MATRIX (*.12.*** * 12.*)*) 

PRINT 16 

5 CONTINUE 

JLOW aU2*(MMK - 1) + MIN 
IF(MMK - NN) 11 * 12 * 12 

12 JUP a MAX 
GO TO 13 

11 JUP = JLOW + 111 

13 00 14 I = 1 * N 

DO 20 J a JLOW * JUP 

IF!K1 I * J)*E0*1) GO TO IQ 

K ( I *J) = 1H. 

GO TO 20 
IQ KU.J) = 1H1 
20 CONTINUE 

15 JfORMAT (2X*I3*3X»I3*3X*112A1) 

PRINT 15* INDU ) *JND! I ) *!K! I *J) *JaJLOW.JUP) 

14 CONTINUE 

DO 100 I a MIN*MAX 
DO 100 J a MIN *MAX 
IF(K!I.J).FQ.1H1) GO TO 50 
K! !* J) = 0 
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GO TO 100 
*>0 K< I *J> □ 1 
100 CONTINUE 
RETURN 
END 

C##** 

c#*** 

SUBROUTINE TEAR (M!N *MAX *L) 

C***A THIS SUBROUTINE PERFORMS THE TEARING ON A PARTITION By LEDET S 
C***» ALGORITHM. 

DIMENSION K(162 * 162)*INDI162)*JND1162).ISTOR(665>♦IOUTST(162> * INZI 
1 10).JNZ120).KSETl162) 

COMMON IND*JND*N*NPO»K »IOUTST 
COMMON /!/ LK20) *LO(20) 

COMMON /L/ LOOP!162*11) 

NPT = N +2 
NP3 = N + 3 
NP4 * N +4 
ID * MIN -1 
L = 0 

C***» THE ELEMENTS OF THE N+3 COLUMN OF K ARE SET EQUAL TO THE NUMBER 
C***# OF NEW VARIABLES IN EACH ROW* 

DO 10 I * MIN *MAX 

KINP4•I) * 0 

K( I *NP3) « 0 

DO 10 J * MlNfMAX 

K(NP4 * I) = K(NP4 * I) + K<J, 1) 

10 KC I .NP3) = M I *NP3) ♦ KM.J) 

DO 12 I = MIN*MAX 
12 KSETl!) = 1C ( I *NP3 ) 

IK = 0 

C#*** IF ALL OF THE EQUATIONS HAVE BEEN REORDERED* STOP. 

1 IF(ID.GE.MAX-L) GO TO 900 
IDP = ID +1 

C***# THE ELEMENTS OF THE N+4 COLUMN OF K ARE SET EQUAL TO THE NUMBER 
C#«** OF VARIABLES IN EACH ROW WHICH ARE INCLUDED IN THE LAST OPEN LOOP 
DO 20 I = IDtMAX 
KJD * 0 

IU » LOOPII.ll) 

IF(IU.EQ.O) GO TO 19 
DO 18 I LA * 1*IU 
MXL * MAX -L+IK -1 
DO 16 JLA * ID *MXL 

IF(JNOlJLA V.EQ*LOOP<I * I LA)) GO TO 17 

16 CONTINUE 
GO TO 18 

17 KJD * KJD +1 

18 CONTINUE 

IF(KS£T(l),EQ.l»AND.KJD.EQ.1)KJO ■ 2 

19 CONTINUE 

20 KII.NP4) ■ KSET( I) ♦ KJD 
KSUM * 1 

IK » 1 

25 IF(ID.GE.MAX-L) GO TO 900 
INDEX » 1 

!DP * ID +1 

<;♦*** THE ROWS ARE SEARCHED HERE TO FIND THE ROW WHICH CONTAINS IK VAR- 
C**»* iables. 

DO 700 I * I DP. MAX 

IF(K(I.NP3)*NE.IK) GO TO 700 

C**** THIS BRANCH ALLOWS FOR CHOOSING THE EQUATION WITH THE MINIMUM 
C*»** NUMBER OF NONZERO ELEMENTS IN THE LAST LOOP. 

KOMPAR = 0 

MBL * MAX -L 

DO 29 I AC * I.MAX 

IFIKl1AC.NP3).NE.IK) GO TO 29 

I SUM * 0 

DO 26 JAC * I DP *MBL 

26 I SUM * I SUM ♦ K( IAC.JAC)*K<NP4*JAC) 

IF(ISUM.LE.KOMPAR) GO TO 29 
KOMPAR = |SUM 

CALL ROWXC(IAC * I) 

29 CONTINUE 

IF(K(T *NP4) -KSUM) 700*35.30 
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30 INDEX = 2 
GO TO 700 

C***« ROW I IS CHECKED TO SEE IF IT HAS ANY NON OUTPUT VARIABLES WHICH 
C*#** HAVE NOT YET BEEN SPECIFIED. IF ANY EXIST THE SEARCH CONTINUES. 
35 DO 45 IL * 1.10 

IF(LOOP(WIL1•EG•0) GO TO 45 
DO 40 JL » 1 * ID 

IF(JND<JL).EQ.LOOPtI.IL)► GO TO 45 
40 CONTINUE 

DO 42 JL * 1 *L 

IF(JND(MAX-JL+1).EO.LOOP{ I ♦ IL) 1 GO TO 45 
42 CONTINUE 
GO TO 700 
45 CONTINUE 

IF(IK.LT.2) GO TO 90 

C##** THIS BRANCH IS TAKEN IF A NEW LOOP IS OPENED BY TAKING ROW I. 

I KM * IK -1 
TOP = ID + t 
MXA = MAX -L 
KOUNT » 0 

DO 50 IA * I DP ♦ MXA 

IF(K(I * IA).EQ.1.AND.KINP3.IA).NE.l) KOUNT = KOUNT +1 
50 CONTINUE 

IFIKOUNT.LT•I KM) GO TO 700 
DO 53 IA = I DP .MAX 
53 KSET(IA ) = KlIA.NP3) 

C*#** THE COLUMNS OF THE ITERATES ARE PLACED AT THE END OF THE BLOCK 
c***» AND THE COLUMN OF THE OUTPUT VARIABLE PLACED NEXT IN THE SEQUENCE 
C**** THE COLUMNS REMOVED ARE SUBTRACTED FROM THE N+3 COLUMN. 

DO 60 IA s I DP . MXA 
JB = MXA +1 DP -IA 

IFIKIWJBWNE.l.OR.KINPWJBWEQ.il GO TO 60 
L - L +1 

CALL COLXCIJB.MAX-L+1) 

LHL ) = ID +1 
LO(L> * 0 

DO 55 IB = I DP. MAX 

55 KUB.NP3) * KIIB.NP3) - K ( IB .MAX-L+1) 

60 CONTINUE 

IF(KOUNT.GT.IKM) GO TO 75 
LOlLl - ID +1 
DO 70 JA = I DP. MXA 
IFIKII»JA>«NE.lt GO TO 70 

C***» THE BOOLEAN SUM OF ALL OF THE COLUMNS WHICH CAN BE ITERATES IS 
C**** COMPUTED AND PLACED IN THE PROPER COLUMN. 

DO 65 IA = I DP.MAX 
65 KUA.NP3) =• KIIA.NP31 - K(IA.JA) 

GO TO 85 
70 CONTINUE 
75 JA = MAX -L+l 
L = L -1 

LO(L) * ID +1 

DO 80 IA = ID.MAX 

80 KHA.MAX-L + 1J = KlIA.MAX-L+1) .OR.K(IA.JA) 

85 CALL COLXCIID +1*JA) 

GO TO 145 

C*### THIS BRANCH IS TAKEN IF EQUATION I HAS ONLY ONE NEW VARIABLE. 

90 MXA = MAX -L 
I DP = ID +1 
DO 100 JA = IDP.MXA 
IFIKI WJAWNE.l) GO TO 100 
C***» THE COLUMN IS SUBTRACTED FROM THE N+3 COLUMN. 

DO 95 IA = I DP.MAX 
05 KIIA.NP3) = KIIA.NP3) -KIJA.JA) 

GO TO 110 
100 CONTINUE 

C#*** THE COLUMN IS PLACED NEXT IN THE SEQUENCE. 

110 CALL COLXCI ID+WJA) 

IF(KSUM.NE.2.0R.KINP3. ID+1 WEQ.l) GO TO 145 
IFILOIL).EO.O) GO TO 125 
!LL S = LHL) 

DO 120 IA = ILL.ID 

IFIKI IA.NP4WGT.2) GO TO 145 
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120 CONTINUE 
GO TO 135 
125 ILL = LI a I 

DO 130 IA x ILL » ID 
IF(KdA.NP4).GT.2) GO TO 145 
13C CONTINUE 
135 LOCL) = ID +1 

C**»* IF THE VARIABLE CAN BE AN ITERATE THE BOOLEAN UNION OF THE COLUMN 
C**#* AND THE LAST LOOP COLUMN IS PLACED IN THE LAST LOOP COLUMN. 

DO 140 IA = IDP.MAX 

140 KMA*MAX-L + 1) = KOA.MAX-L + 1) .OR .K ( IA * I D+l ) 

145 ID * ID +1 

c»*»* row i is placed next in the sequence. 

CALL ROWXCMD.M 
KDUM = KSET (ID) 

KSET(ID) = KSETtI) 

KSET(I) = KDUM 
IA = 1 

C**#* THE REMAINING EQUATIONS ARE CHECKED TO SEE IF AN OUTPUT SET CAN 
C***« BE ASSIGNED. 

150 CALL OUTSET(ID+1*MAX*FLAG*2) 

IF{FLAG.EQ.O) GO TO 250 

C***# IF AN OUTPUT SET COULD NOT BE FOUND AND THERE IS SOME EQUATION 
C**»* WHICH CONTAINED MORE THAN ONE ITERATE THE ITERATES OF THAT EQUA¬ 
LS** TION ARE REARRANGED AND ANOTHER CHECK MADE FOR AN OUTPUT SET. 

LM x L -1 

IF( IA.GE.L) GO TO 260 
DO 240 IB = IA.LM 
IFILI (IB) .NE.LK IB+11) GO TO 240 
I DP = ID +1 
DO 160 IC = I DP * MAX 
IFIKIICtMAX-IB+1).EQ.l) GO TO 240 
160 CONTINUE 

IC = LK 18) 

IF(K(NP3»!C).EQ.l) GO TO 240 
CALL COLXC(IC.MAX-IB+1) 

IC * IB +1 

165 IF(LI (IO.NE.LI { IC+1 ) .OR.IC.EG.L) GO TO 170 
IC * IC +1 
GO TO 165 
170 IL = LI IIC) 

IU = LO(IC) 

DO 185 JC ; IL » IU 
IF(K(NP3*JC).EQ.l) GO TO 185 
DO 180 IE x MIN * MAX 

180 K(IE * MAX -IC +1) * K+IEfMAX-IC + 1)•OR•K(I£♦JC ) 

105 CONTINUE 
IA * IC 
GO TO 150 
240 CONTINUE 

C»*** THE ITERATES COULD NOT BE REARRANGED AND EQUATION I CANNOT BE PUT 
C***» NEXT TN THE SEQUENCE. 

GO TO 260 

C***# IF EQUATION I CAN BE PUT NEXT IN THE SEQUENCE* RETURN TO SEARCH 
C**«* ROWS AGAIN. 

250 IFdK.GT.ll 1, 255 
255 KSUM = 1 
GO TO 25 

C**#* THIS BRANCH IS TAKEN WHEN AN OUTPUT SET COULD NOT BE FOUND AND 
C***« THE SYSTEM IS RESTORED TO THE WAY IT WAS BEFORE EQUATION I WAS 

c**»* chosen. 

260 IF(IK.GT.l) GO TO 290 

IF(KSUM.NE.2.0R.K(NP3* ID).EQ.l) GO TO 200 

IFtLOfL).NE.ID) GO TO 280 

CALL PACKfMlNfMAX *MAX-L + 1*MAX-L + 1» 2) 

KD = ID 

265 KD = KD -1 

IF(K(NP3*KD).EQ.l) GO TO 265 
LO(U = 0 

IF(KD.LT.LI(L)) GO TO 280 
LO(L) * KD 
IL * Li(LI 
IU * LO(L) 

DO 275 JA * ILflU 
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IF(K(NP3.JA).EQ.l) GO TO 275 
DO 270 IA . MIN*MAX 

270 M IA.MAX-L+1) * K 1 IA * MAX-L + 1) .OR.MIA.JA) 

275 CONTINUE 
280 ID = ID -1 

DO 285 IA = ID .MAX 

285 MIA.NP3) * MIA.NP3) + MIA .ID + 1 > 

GO TO 700 
290 JA = ID 
I KM * IK 
I DM = ID -1 

300 DO 310 IB = I DM.MAX 

310 MIB.NP3) * MIB.NP3) + MIB.JA) 

I KM = I KM -1 

IFfIKM.LE.O) GO TO 320 

JA = MAX-L+IKM 

CALL PACK(MIN.MAX.JA.JA.2) 

GO TO 300 
320 L = L -IK +1 
ID = ID -1 
700 CONTINUE 

C**#* NO MORE EQUATIONS CAN BE FOUND AT THIS POINT WHICH CONTAIN IK NEW 
C***+ VARIABLES AND KSUM VARIABLES IN THE LAST OPEN LOOP. IF INDEX = 2 
C***» THERE ARE SOME ROWS LEFT WITH IK NEW VARIABLES BUT THEY HAVE MORE 
C***» THAN KSUM VARIABLES IN THE LAST OPEN LOOP. 

IF(INDEX.NE.2) GO TO 710 
KSUM = KSUM +1 

IFIKSUM.GT.(MAX-MIN+1M 800.25 
710 IFf IK.GT.IMAX-MIN+1M GO TO 800 
KSUM = 1 
IK * IK +1 
GO TO 25 

eon PRINT 810 , IND(MIN) 

810 FORMAT(* NO TEAR COULD BE FOUND FOR THE BLOCK BEGINNING WITH EQUAT 
11 ON*.13.* WITH THE NON ITERATES SPECIFIED. • ) 

900 CONTINUE 

C**#* THE LAST OUTPUT SET ASSIGNMENT IS OBTAINED FROM THE N+l COLUMN 
C*#** AND THE CHECK EQUATIONS ARE ORDERED PROPERLY. 

DO 915 I = 1 * L 

II = MAX -1+1 

I F ( K ( II .NP01.GE.10000) K(II.NPO) = MII.NPOl - 10000 
IFIKIII .NPO)-GE.1000 ) MII.NPOl = MII.NPO) - 1000 

915 CONTINUE 

DO 916 1 = 1»L 

II = MAX -1+1 
DO 913 J = 1*L 
JJ = MAX-J+1 

IF(MJJ*NPO).NE.II) GO TO 913 
CALL ROWXCIII.JJ) 

GO TO 916 
91 * CONTINUE 

916 CONTINUE 

DO 910 I = MIN.MAX 
910 MI.NPT) = I 

LI(L+l) = MAX +1 

DO 950 I * l.L 

CALL REORD (I.MIN.MAX) 

950 CONTINUE 

K(MIN.NPT) * 1000 +MIN 
KIMAX.NPT) = 2000 + MAX 
CALL PACK(MIN.MAX.MIN.MAX.2) 

1010 RETURN 
END 

C**»* 

c***# 

SUBROUTINE REORD(IL*MIN.MAX) 

C#*** THIS SUBROUTINE REORDERS LOOP IL SO THAT THE PROPER ITERATE IS 
C»*** PLACED IN THE LOOP COLUMN AND THE OUTPUT VARIABLES ARE REASSIGNED 

c***# properly. 

DIMENSION K(162.162).IND(162).JND(162).ISTORC665).lOUTSTf162).IN2< 
1 10) .JNZI20).KSETI162) 

COMMON IND.JND.N.NPO *K.I OUT ST 
COMMON /T/ LM20).LO(20) 

DIMENSION !IND(162).JJNDI162) 
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NP3 « N +3 
INDL = LIIIU 
JNDL x LI(IL) -1 
JLL = LII ID 
JUU = LO(IL) 

IF(LO(IL J .EQ.O I GO TO 900 
00 20 J x JLL.JUU 

IF<K(MAX-IL+1.J).EQ.1.AND.K(NP3.J).NE.1> GO TO 30 
20 CONTINUE 
GO TO 900 

30 JJND(MAX-IL+1) = JND(J) 

IIND(JLL) x I NO(J) 

JM = J -1 
ITER = J 

AO IFIJM.LT.JLL) GO TO 100 
DO SO JK x JLL *JM 
IF ( K (J*JK)•E0•1) GO TO 60 
50 CONTINUE 
GO TO 100 
60 JNDL * JNDL +1 
INDL = INDL +1 
IIND(INDL) = IND(JK) 

JJND( JNDL ) * JND(JK) 

JM x JK -1 
J = JK 
GO TO 40 

100 JNDL = JNDL «-l 

JJND(JNDL) = JND(MAX-IL+l) 

JNDfMAX-IL+1) = JJND(MAX-IL+l) 

DO ISO I = JLL *JUU 
K(JLL * I) a K(NP3*I) 

K(NP3.I) x 0 

IF(I.EO.ITER) GO TO ISO 

KOMPAR = JND(I) 

DO 120 J * JLL.JNDL 
IFCKOMPAR.EO.JJNDIJ)) GO TO 150 
120 CONTINUE 

JNDL * JNDL +1 
JJNDIJNDL) = KOMPAR 
IIND(JNDL) * INDII) 

150 CONTINUE 

DO 175 I = JLL.JUU 
INDU) * IINDM) 

IFIKIJLL.I).NE.l) GO TO 175 
KOMPAR » JND(I) 

DO 160 J x JLL.JUU 
1F C JJNDIJ).EO.KOMPAR) GO TO 170 
160 CONTINUE 
170 KINP3.J) * 1 
175 JND(I) * JJND(I) 

900 RETURN 
END 

C***# 

c**»* 

SUBROUTINE OUTSET<MIN>MAX.FLAG.KASE) 

£«*«« THIS subroutine finds an output set for rows and columns from min 
C***» NO MAX. FLAG indicates whether an output set could be found. 

c*#** KASE IS AN INDICATION TO THE SUBROUTINE WHETHER THE OUTPUT SET 
C*#** SHOULD BE PLACED ON THE MAIN DIAGONAL. 

DIMENSION K(162.162)*IND(162).JND(162).IOUTSTI162) 

COMMON IND.JND.N.NPO.K*IOUTST 
303 FORMAT(///12//) 

flag = 0.0 

DO 50 I =MIN.MAX 
K(NPO.I) « 0 
50 K(I.NPO) x 0 
100 CONTINUE 

C**#* A PHI IS PLACED IN A ROW OR A COLUMN TO INDICATE AN ASSIGNMENT OF 
C**»* A COLUMN TO A ROW. 

C**## CHOOSE ANY COLUMN IN WHICH NO PHI EXlxTS — COL. J. 

DO 150 JxMIN.MAX 
IF IK(NPO.J).LE.1000)200.150 
150 CONTINUE 
IL = 150 
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GO TO 1500 

C*»** IN COL. J FIND A NONZERO ELEMENT IN AN UNTAGGED ROW. 

C*#** FIRST LOOIC FOR A ROW WITH NO PHI 
200 DO 250 I » MIN*MAX 

IF ( K(I,NPO) .LT. 1*000 .AND. MI.Jj . GT . 0 ) 500 * 250 

250 CONTINUE 

C#*#* ip NO NONZERO ELEMENT COULD BE FOUND IN A ROW WITHOUT A PHI* THEN 
C*»** LOOK FOR ONE WITH A PHI ALREADY IN IT. 

DO 275 I *MIN*MAX 

IFf X(I,NPO) .LT.10000.AND. KU*J) .GT•0)500*275 

275 CONTINUE 

C**»* if NO NONZERO ELEMENT IN AN UNTAGGED COLUMN COULD BE FOUND TAG 
C***» THIS COLUMN. 

IF(XINPO*J).GE.10000)400,300 
300 X{NPO*J) * X(NPO *J) +10000 

C*+»# FIND A TAGGED COLUMN CONTAINING A NONZERO ELEMENT IN AN UNTAGGED 

C**#* ROW. 

C***# FIRST LOOX FOR A ROW WITH NO PHI. 

400 DO 450 J = MIN*MAX 

IFfX(NPO*JI.GE.10000)425*450 
425 DO 440 I^MIN.MAX 

IF { X { I *NPOWLT.lOOO .AND. X ( ! *J) .GT.0) 1000 ♦ 440 
440 CONTINUE 
450 CONTINUE 

C+*** if NO NONZERO ELEMENT COULD BE FOUND IN A ROW WITHOUT A PHI* THEN 
C + + ** LOOX FOR ONE WITH A PHI ALREADY IN IT. 

DO 475 J*MIN*MAX 
IF (X fNPO*J).GE.l0000>455*475 
455 DO 445 I *MIN*MAX 

IF(XU.NPO).LT.10000.AND.K<I *J).GT.0)1000*445 
445 CONTINUE 
475 CONTINUE 

C*+#* IF NONE COULD BE FOUND GO TO STEP 1200. 

GO TO 1200 

C#**» FIND ANOTHER NONZERO ELEMENT IN COL. J WHICH IS IN AN UNTAGGED ROW 
C***+ AND PUT PHI IN IT AND ERASE OLO PHI, 

1000 LY * X(NPO*J) -11000 

IF(X(LY*NPO).LT.10000)1025*1050 
1025 X(LY.NPO) * 0 
GO TO 1075 

1050 XILY.NPO) * 10000 
C»»* + TAG COLUMN J 
1075 X{NPO *JI * 11000 + I 
C***» SAVE OLD POSITION OF PHI IN ROW I* IF ANY. 

XDUM * XU,NPO) 

C***# PUT NEW POSITION OF PHI IN ROW I AND TAG IT. 

X(I.NPO) - 11000 +J 
GO TO 900 

C**»* TAG COLUMN J 

500 X(NPO.J) * 11000 +1 
C+*** SAVE OLD POSITION OF PHI. 

XDUM s XU,NPO) 

C***# PUT NEW POSITION OF PHI IN ROW I AND TAG IT. 

X( I *NPO) » 11000 +J 

C**#* IF ANOTHER PHI EXISTS IN ROW I ERASE IT. 

900 IF{XDUM.GT.10000)XDUM * XDUM - 10000 
IF (XDUM.GT.1000)XDUM*KDUM -1000 
IFfXDUM.GT.0)600.800 

C**#* CONSIDER THE COLUMN IN WHICH THE PHI HAS JUST BEEN ERASED. 

600 J= XDUM 
XDUM - 0 

IF(X(NPO*JJ.GT.10000)625*650 
625 X(NPO,J) • 10000 
GO TO 200 
650 X(NPO * J) • 0 
GO TO 200 
600 XDUM * 0 

c#*+* REMOVF ALL TAGS ON ROWS AND COLUMNS. 

DO 120 I*MIN * MAX 
IFf K f NPO *U.GE. 10000 U05*1I5 
105 X f NPO *!) * X(NPO.I) -10000 
115 IF(XU»NP0).GE.10000) 117*120 
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117 K ( I *NPO) = K(I.NPO) -10000 

120 CONTINUE 

700 00 750 I=MIN.MAX 

IF r K{I.NPOWLT*1000.OR.KtNPO*I).LT.1000)100.750 
750 CONTINUE 

C**#* THE OUTPUT SET ELEMENTS ARE PLACED IN IOUTST AND THE COLUMNS PER- 
C*»** MUTED TO PUT THE OUTPUT SET ON THE MAIN DIAGONAL. 

1100 IFIKASE.EQ.2 I GO TO 70 
PRINT 1111 
nil FORMAT I 1 H 1 ) 

PRINT 1120 

1120 FORMAT (10X.//34H AN OUTPUT SET HAS BEEN OBTAINED. 

DO 1150 I »MIN.MAX 

IF(K(I*NPO)*GT•10000)K(I*NPO) * Ktl.NPO) -10000 
IF(X( I .NPOI«GT.1000 JK{I*NPOI = Ktl.NPO) -1000 
10 « K(I.NPO) 

IOUTST(I) a JNDt10) 

1175 FORMAT( 14X. 13* 14X. 13) 

1150 CONTINUE 

DO 30 I*MIN.MAX 
DO 10 J*MIN*MAX 
IF(JND(J).EQ.IOUTST(I)>20*10 
10 CONTINUE 
20 CALL COLXC(I*J) 

30 CONTINUE 
70 RETURN 

C**«* IF AN OUTPUT SET COULD NOT BE FOUND THE EQUATIONS WITH FEWER VAR- 
C***» IABLES THAN EQUATIONS ARE PRINTED OUT IF KASE * 1. 

1200 IF tKASE.EQ.2) GO TO 1260 
PRINT 1220 

1220 FORMAT(10X *//35H AN OUTPUT SET COULD NOT BE FOUND. /10X.70H TH 

IE FOLLOWING EQUATIONS CONTAIN FEWER VARIABLES THAN EQUATIONS. 

2 /10X*9H EQUATIONS ) 

DO 1250 I*M1N*MAX 
IF(KM *NPO) .LT. 10000) 1225* 1250 
1225 PRINT 1230*1 
1230 FORMAT (14X.I3) 

1250 CONTINUE 
1260 FLAG = 1.0 
RETURN 

1500 PRINT 1550.IL 

1550 FORMAT{10X.45H AN 1 LEGAL BRANCH WAS ATTEMPTED AT STATMENT *14) 

flag = 1.0 

RETURN 

END 

C**»* 

c**** 

SUBROUTINE SUMRC(MIN*MAX) 

C**»* THIS SUBROUTINE SUMS EACH ROW AND COLUMN AND PLACES THE RESULT IN 
£*••* THE N +1 COLUMN AND ROW RESPECTIVELY. 

DIMENSION K(162.162)*IND(162)*JNDt162) 

COMMON !ND.JND*N*NPO*K 
DO 100 IsMIN.MAX 
K(I*NPOI * 0 
DO 100 JrMIN.MAX 

100 K(I.NPO) « K<I*NPO) + K(I.J) 

DO 200 J=MIN*MAX 
KtNPO.J) » 0 
DO 200 I * MIN.MAX 
200 KtNPO.J) * KtNPO.J) + K(I.J) 

RETURN 

END 

C**** 

c*»*» 

SUBROUTINE COLXC t JD*JSAVE > 

C»*** THIS SUBROUTINE EXCHANGES THE ELEMENTS OF COLUMN JD AND JSAVE IN 
C#*## MATRIX K. 

DIMENSION K<162.162).INDt162).JNDt162) 

COMMON IND.JND.N.NPO.K 
JJDUM ■ JND(JD) 

JNDt JD) * JND(JSAVE) 

JND(JSAVE) - JJDUM 
NP4 - N +4 
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DO 850 I a 1*NP4 
JDUM * K (T *JD) 

K IT * JD) * <(!* JSAVF) 

850 K( !*JSAVE) * JDUM 
RETURN 
END 

C**** 

c**»* 

SUBROUTINE ROWXC(ID.ISAVE) 

C***» THIS SUBROUTINE EXCHANGES THE ELEMENTS OF ROW ID AND ISAVE OF K 
C#*** AND THE CORRESPONDING ROWS OF LOOP. 

DIMENSION K1162*162)* IND{162)*JND(162) 

COMMON IND.JND.N.NPO.K 
COMMON /L/ LOOP(162*11) 

IDUM = IND(ID) 

INDMD) = I ND ( ISAVE ) 

IND fISAVE) = IDUM 
NP4 = N +4 
DO 515 JM a 1•NP4 
KDUM a K(ID,JM) 

K( ID*JM) * K{ISAVE * JM) 

515 KI ISAVE.JM) = KDUM 
DO 514 JM * 1*11 
LDUM = LOOP(ID *JM) 

LOOP(ID *JM) a LOOP(ISAVE.JM) 

514 LOOP(I SAVE.JM) = LDUM 
RETURN 
END 

C***» 

C***+ 

SUBROUTINE STEWRD(MIN .MAX) 

C#*«* THIS SUBROUTINE EXECUTES STEWARD S ALGORITHM FOR PARTITIONING FOR 
C**** the SUBMATRIX OF ROWS AND COLUMNS OF K FROM MIN TO MAX. 

DIMENSION KU62.162) *IND( 162) .JNDI162) . LST ( 162 ) 

COMMON IND*JND.N.NPO.K 

C#**# THE N + l COLUMN CONTAINS THE SUM OF EACH ROW. THE DIAGONAL ELE- 
C**+* MENTS (OUTPUT SET ELEMENTS) ARE REMOVFD. 

ID = MIN 

INDEX a 0 

LSTL = 0 

NPT = N+ 2 

DO 10 I = MlN.MAX 

K( I *NP0) a K(I,NPO) -K(I . I ) 

K( I .NPT) = 0 
10 KIM) =* 0 

C***» THE N+l ROW IS SEARCHED FOR COLUMNS WHICH CONTAIN NO NONZERO ELE- 
C**#* MENTS AND HAS NOT YET BEEN REMOVED FROM K• 

1 DO 50 I = MIN.MAX 

IF{(K(I.NPOUEQ.O).AND.( K ( I .NPT ) .LT.1000 )) GO TO 60 
50 CONTINUE 
INDEX = 1 
GO TO 65 

C**»* IF A ROW HAS ALL ZERO ELEMENTS, IT IS REMOVED AND INDICATED NEXT 
C***# IN THE SEQUENCE FOR SOLVING THE EQUATIONS. 

60 K( I .NPT) = 10000 + ID 
DO 62 MN a MIN,MAX 
K(MN * I ) = 0 

IFIKIMN*NPT).NE.(1000 + I)) GO TO 62 
PC ( MN, NPT ) = 10000 + ID 
62 CONTINUE 

CALL SUMRC(MIN.MAX) 

ID a ID + 1 
65 CONTINUE 

C*### THE ROWS ARE CHECKED HERE TO DETERMINE IF ALL ROWS HAVE BEEN AS- 
C***+ SIGNED AN OUTPUT COLUMN, 

DO 70 IJ = MIN,MAX 
IF(K(IJ.NPT).LT.1000IG0 TO 75 
70 CONTINUE 
GO TO 500 

75 IF(INDEX.EO.O) GO TO 1 
INDEX = 0 

C**#* ADD ROW IJ TO THE LIST. 

LSTL = LSTL ♦ 1 
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LST(LSTL) » IJ 

C**** FORM A LIST OF ROWS EACH OF WHICH FEEDS THE ONE IMMEDIATELY ABOVE 
C***» IT IN THE LIST. WHEN ANY ROW IS ENCOUNTERED TWICE BRANCH TO 200. 
THIS IS THE PATH TRACING TECHNIQUE. 

BO DO 100 J * MIN.MAX 

IFUKIIJ*J).EQ.1).AND.(K(J*NPT).LT.1000 M GO TO 110 
100 CONTINUE 
GO TO 1000 

110 IF t LSTL.LT•1 I GO TO 175 
DO 150 1*1.LSTL 
IF(LST( D.EO.JI GO TO 200 
150 CONTINUE 
175 LSTL * LSTL +1 

lst/lstli * J 

TJ * J 
GO TO 80 

C***# TAG EACH COLUMN IN THE LIST BETWEEN THE ELEMENT OF THE REPEATED 
C#*** ROW AND THE END OF THE LIST AS A LOOP. 

200 MNAI * LST(I) 

DO 220 MN * I.LSTL 
MNA * LST(MNI 
IF (MN .EQ.I> GO TO 205 
K(MNA.NPT) * 1000 +MNAI 
205 DO 208 L * MIN.MAX 

IF(<(L*NPT).NE.(1000 + MNAM GO TO 208 
K(L.NPT) • 1000 + MNAI 
208 CONTINUE 
220 CONTINUE 
1 = 1 + 1 

C***» FORM ONE ROW AND ONE COLUMN TO REPRESENT THE LOOP. THE ROW IS THE 
C***# BOOLEAN UNION OF ALL ROWS IN THE LOOP AND THE COLUMN IS THE BOO- 
C***» LEAN UNION OF ALL THE COLUMNS IN THE LOOP. 

DO 217 MN * I.LSTL 
MNA * LST(MN) 

DO 217 J * MIN.MAX 
K ( MNAI * J ) « KIMNAI.J J.OR.K<MNA.J) 

X(J.MNAI) * (C ( J »MNAT ) .OR . X ( J.MNA ) 

K(MNA.J) * 0 
217 PC(J.MNA) * 0 

K (MNAI.MNAI ) * 0 
CALL SUMRCIMIN.MAX* 

DO 250 I* MIN.MAX 
250 LST(I) = 0 
LSTL « 0 
GO TO 65 

<;*•** THE TAGS ARE REMOVED FROM THE ROWS AND COLUMNS HERE. 

500 DO 600 I * MIN.MAX 

IF(Kf I.NPTl.GT.lOOOOHCd.NPT) = K(I.NPT) -10000 
IF(K( T.NPTl.GT.lOOO JMI.NPTI * K(I.NPT) -1000 
K(I.NPO) * K(I.NPT > 

Kl I .1) * I NO(I) 

600 CONTINUE 

ID * ID - 1 
IN * MIN 

C**+* GROUP THE ROWS AND COLUMNS OF EACH LOOP TOGETHER AND ORDER THEM IN 
C*#** THE CORRECT PRECEDENCE ORDER. 

DO 750 I * MIN.ID 
DO 700 J a MIN.MAX 
IF(K(J*NPT).NE.I) GO TO 700 
IF(IN.EO.JI GO TO 6Q0 
CALL ROWXC(IN.J) 

690 IN * IN +1 
700 CONTINUE 
750 CONTINUE 
GO TO 1500 

1000 PRINT 1001 

1001 FORMAT (38H AN ILLEGAL BRANCH WAS ATTEMPTED ) 

1500 RETURN 

END 
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Nomenclature 2 


0/J 

An element of matrix A 

n 

The number of equations or variables 

A 

Adjacency matrix 


in a system 

A 

Boolean matrix in general 

P 

Adjacency matrix of information 

bu 

An element of matrix B 


flow among the loops in a system of 

B 

Boolean matrix in general 


equations 

Cl] 

An element of matrix C 

nj 

An element of matrix R 

C 

Information matrix 

r k 

' ij 

Element of the kth power of matrix R 

c 

Boolean matrix in general 

R 

Adjacency matrix 

E 

A set of edges in a graph 

r * 

r U 

Element of R* 

f 

Function in general 

R* 

Reachability matrix 

G 

A digraph 

SlJ 

An element of matrix S 

H 

A digraph 

S 

Occurrence matrix 

i 

Index for a row of a matrix 

T 

Information matrix 

j 

Index for a column of a matrix 

X 

System variable 

M 

Reduced occurrence matrix used in 

y 

System variable 


finding disjoint subsystems 

2 

System variable 
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I. Introduction 

Two immiscible fluids, in contact with each other, share a common surface, 
called the “interface.” Operations involving transfer of matter or of heat 
across an interface are very common in chemical industry. In such operations 
a large interfacial area per unit volume is necessary if the desired transfer is 
to be obtained rapidly in equipment of finite size. Three common methods of 
providing a high ratio of interfacial area to volume are now discussed. 

A. Film Method 

Here, either both the fluids are spread over an existing surface or one of 
them is spread and the other is present in bulk. Usually, the walls of a column 
are utilized for this purpose by wetting them with one of the fluids. Wetted- 
wall columns and rotating film reactors (Kll) make use of this principle to 
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bring about gas-liquid or liquid-liquid contact. Two liquids may be contacted 
cocurrently by allowing them to flow in films down opposite sides of a vertical 
wire mesh. 

The drawback of these techniques is the requirement of artificial surfaces 
for the stabilization of the films created. 

B. Rupture of Bulk Fluid 

In this method a large bulk of one of the fluids is broken into smaller 
particles inside a continuous phase of the other fluid. This can be caused 
either by an artificial stirring action or by the instability of the big globule 
itself. A typical example of this method is the rupture of bubbles during their 
ascent in a stirred-tank reactor. 

C. Dispersion through Submerged Orifices 

The method of dispersion through submerged nozzles, slots, or holes is 
the simplest and hence the most common. It permits equipment of extremely 
simple design and leads to reasonably large interfacial areas. 

As the dispersed phase moves, the bubbles (gas in liquid) or drops (liquid 
in liquid or gas) may either coalesce or rupture, depending on the conditions 
existing. Sometimes packings or stirrers are introduced to facilitate these 
phenomena. When the dispersed-phase particles arrive at the top of the column, 
they coalesce and form a bulk interface with the continuous phase, across 
which transfer processes can continue. 

Some industrial operations involving bubble and drop formation are ex¬ 
traction, direct contact heat exchange, distillation, absorption, sparger re¬ 
actors, spray drying and atomization, fluidization, nucleate boiling, air lifts, 
and flotation. 

In all these operations involving bubbles and drops, three stages have to 
be studied, viz. (i) the formation of bubbles or drops, (ii) the movement of 
bubbles or drops through the continuous phase and possible coalescence 
therein, and (iii) the formation of the interface. This review is an attempt to 
understand the first of the three aspects of the study, especially in the case of 
submerged orifices. 


II. Measurement of Bubble Volume 


In any study on bubble formation, the measurement of bubble volume is 
of primary importance. The bubbles formed at a nozzle ascend through the 
liquid column and rupture at the surface of the liquid. During their ascent, 
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the bubbles may coalesce to form bigger bubbles, or may rupture, giving rise 
to smaller bubbles. In an actual operation, both these processes go on inside 
the liquid column. Thus, the bubble size in the middle of the column is not 
the same as that at the tip of the nozzle. Though approximate evaluation of 
bubble volumes at any position in the column can be made, the methods dis¬ 
cussed in this section are meant mainly for evaluating the bubble volume at 
the nozzle-tip. 

A. Direct Methods for Evaluating Bubble Volume 

Methods which yield the bubble size directly make use of photography in 
one form or another. 


1. Cinephotography 

A number of investigators (D8, H7, K18, L2) have employed cinephoto¬ 
graphy to obtain qualitative information about the phenomenon of bubble 
formation. Speeds varying from 80 frames per second (D8) to 4000 frames 
per second (H7) have been used. This method can also be used to obtain 
quantitative information. The photographs taken are projected, and their size 
is measured. As the size of the image on the film depends upon the distance 
of the nozzle from the lens, it is necessary to have a graduated scale or other 
object of known size placed by the side of the nozzle at the same distance from 
the lens as the nozzle, and visible in the photograph. Thus, when the photo¬ 
graph is projected, a contour of the bubble is obtained from which the bubble 
volume can be calculated by assuming symmetry about the vertical axis. 
Sometimes, photographs are taken simultaneously from two or three mutually 
perpendicular directions to obtain bubble contours from different angles. This 
is particularly advantageous when highly distorted bubbles are examined. 

If the reduction ratio is independently known in terms of the distance be¬ 
tween the object and the camera, then the bubble size can be known by 
measuring the size of image by means of an ocular micrometer. 

Alternatively, the camera is kept at a fixed distance, and a scale is first 
photographed. The ratio of the image size to the actual size gives the reduction 
ratio. If the camera is then positioned at the same distance and the bubble 
photographed, the actual dimension of the bubble can be obtained from the 
image size by dividing it by the reduction ratio. 

The photographic method for evaluating the bubble volume suffers from 
the disadvantage that it does not yield the volume directly but only gives the 
contour of the bubble in a single plane. The lighting used is purely a matter 
of trial and error. Assumptions have to be made regarding the symmetry of 
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the bubble about a fixed axis. Thus, the bubble volume obtained may some¬ 
times be seriously in error, particularly when the bubbles are highly distorted. 
Another disadvantage of this method is that it is very laborious and a number 
of frames must be examined to find the proper dimension of the bubble. 

There are, however, a few points which make this method an attractive 
one. 

It can be used for measuring bubble sizes not only at the nozzle tip but also 
at any position in the liquid column. Thus, even coalesced and ruptured 
bubbles can be examined by this method. 

It examines each bubble individually and does not depend on the average 
of many bubbles. Thus, if subsequent bubbles are of different sizes as some¬ 
times happens when secondary bubbles are formed, this method is the only 
choice. 

It is the only method which can give the rate of growth of the bubble with 
time; the intermediate stages of bubble formation are accurately recorded. 

It is the best method that can be used when the surface area of the bubble 
is required. 

2. X-Ray Cinephotography 

The cinephotographic method discussed above can be used only when the 
fluid in which the bubbles are formed is transparent. If the fluid is opaque, 
like some non-Newtonian fluids or fluidized beds, x-ray cinephotography has 
to be used. Rowe et al (RIO, Rll) have used this technique for studying gas 
bubbles in fluidized beds. The column in which the bubbles are formed is 
placed between the x-ray tube and the cine camera (normally 35 mm). 
Photographs up to 50 frames per second have been obtained by Rowe and 
Partridge (RIO) with an exposure time per frame of the order of 0.01 sec. 

This method has recently been employed by Botterill et al. (B6) for study¬ 
ing the behavior of bubble chains in fluidized beds. Most of the discussion 
given for the cinephotographic method is valid for the x-ray cinephotographic 
method also. 

3. y-Ray Absorption 

This method gives the variation in voidage at any position in the column. 
It consists essentially of passing a y-ray beam through the column and 
measuring the intensity of the emerging beam by its count rate. By calibration, 
the count rate is related to the voidage and hence the voidage corresponding 
to any count rate is obtained. Bloore and Botterill (B5) found the bubble di¬ 
ameter by scanning the face of the column and determining the average pro¬ 
jections of the voidage contours. The breadth of the injected disturbance was 
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considered to be the bubble diameter. The volume of the bubble could be 
determined by assuming it to be spherical at the nozzle tip. 

Baumgarten and Pigford (B2) have employed the y-ray method for their 
study of density fluctuations in fluidized beds. The method is laborious and 
time-consuming, and yields only approximate values based on a large number 
of bubbles. Because of this, the x-ray cinephotographic method is to be pre¬ 
ferred for the study of the behavior of bubbles in fluidized beds. 

B. Indirect Methods for Evaluating Bubble Volume 

In these methods the volumetric flow corrected to the nozzle tip, Q , and 
the frequency of bubble formation, /, are directly measured. The bubble 
volume is then calculated. These methods have a number of limitations. 

First, they cannot be used for the evaluation of volumes of individual 
bubbles. Only an average bubble volume is obtained. Bubble formation is 
generally a cyclic phenomenon, and for a definite flow rate in a particular 
system, the frequency and the bubble volume are time-independent. However, 
there are situations where each bubble is followed by smaller secondary 
bubbles. In such cases, the above methods cannot yield reliable values and 
photographic methods have to be resorted to. 

Second, these methods do not give any information about the shape and 
thereby the surface area of the bubble. Although Quigley et al. (Ql) have 
found from photographic studies that the area evaluated assuming sphericity 
of bubbles is not very different from that actually measured, this evidently 
cannot be true for large and highly distorted bubbles since a sphere has the 
minimum area and any distortion in shape tends to increase it. 

Even with the above-mentioned limitations, the indirect methods con¬ 
stitute the simplest way of evaluating bubble volumes from single nozzles, 
and hence are most extensively used. As these methods involve a knowledge 
of the two quantities Q and /, the ways of measuring each of them are sepa¬ 
rately discussed below. 

1. Measurement of g, the Volumetric Flow Rate 

The volumetric flow rate is measured either by cumulative devices which 
collect the bubbles for a definite period of time or by rate measuring devices. 

a. Cumulative Devices. These devices have been generally employed for 
low gas flow rates or when the gas is required to be collected for gas analysis, 
etc. Various arrangements ranging from inverted burettes to soap-film flow 
meters have been used. For higher flow rates, wet gas meters which yield 
cumulative flows have been used. Some of the devices employed particularly 
for low flow rates are given below. 
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Reservoir method: This method makes use of the displacement principle. 
Brine or any other saturated solution in which a gas has low solubility is used 
as the liquid. Gas from the column is collected in a burette from which the 
displaced liquid flows to a reservoir. As the gas collection proceeds, the gas is 
collected under increasing pressure conditions, thereby changing the flow rate 
as well as the frequency of bubble formation. In order to collect gas under 
atmospheric conditions, the levels of the liquid in the burette and the reservoir 
must always be kept equal. This requires manual adjustments. 

Kumar and Kuloor’s device (K15): This device avoids the necessity of 
continuous control, and it can collect gas samples without disturbing the flow 
conditions in the main equipment as shown in Fig. 1. The device consists of 
a graduated burette (1) for collecting the bubbles and a levelling reservoir (2). 
The stopcock (3) at the top of (1) permits the levelling liquid to fill the burette, 
and thus prepare it for bubble collection. The reservoir (2) is divided into two 
chambers by an overflow tube (4). The outer chamber is connected to this 
tube by the stopcock (5). The burette and the reservoir are connected through 
a rubber tube. The capacity of the reservoir is about one and a half times that 
of the burette. 

When the bubbles are to be collected, the burette is filled with the liquid 
by opening the stopcocks (3) and (5) and raising the reservoir (2). Then, these 



Fig. 1 . Kumar and Kuloor’s device for collecting gas samples. 
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are closed and the reservoir is placed in such a way that the top of the overflow 
tube (4) and the gas inlet (6) are at the same level. The bubbles are then col¬ 
lected under atmospheric pressure. The bubbles are collected for a known 
interval of time and the volume collected is found from the reading of the 
burette when the levels of the liquid in the burette and the reservoir are 
equalized. 

This device has been successfully employed for measuring bubble volumes 
when the chemical reaction between a gas and a liquid is being studied (V2). 
The conditions under which the gas is measured are converted to the con¬ 
ditions of pressure, temperature, etc. existing at the nozzle tip. 

Soap-film flow meter: A modified form of the soap film flow meter used 
by Krishnamurthi, Kumar, Datta, and Kuloor (K10) for collecting bubbles 
at atmospheric pressures is shown in Fig. 2. This device, which makes use of 
the movement of a soap film in a burette, can also be employed to calibrate 
low rate gas flow meters. Gas enters the apparatus from the top. A soap film 
is formed at the tip of the calibrated burette by raising a metallic loop dipped 
in soap solution. A water-seal is used to prevent the gas from escaping through 



Fig. 2. Modified form of soap film flow meter. 
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the opening provided for the rod supporting the metallic loop. As the bubbles 
go on collecting in the burette, the soap film moves along its length, permitting 
the measurement of the volume of the collected gas at any time. 

The advantages of this device are the ease of handling, the absence of 
continuous manual control and the negligible error due to mass transfer. 

Wet gas meter: Wet gas meters can be employed at high flow rates. 

b. Rate Measuring Devices. Rotameters and orifice meters are extensively 
used as rate measuring devices, although any other kind of flow meter can 
be used with equal advantage. The flow can be measured at any position in 
the apparatus but must be converted to the conditions existing at the nozzle 
tip. 

2. Measurement of f the Frequency of Bubble Formation 

When the frequency of bubble formation is very low (<200 bubbles per 
minute), the bubbles can be visually counted without the aid of any instrument. 
When the frequency is higher than this, other methods have to be employed. 

The simplest instrument that finds extensive use is the strobotac which 
illuminates the object at varying frequencies. When the frequency of the 
strobotac flash coincides with that of bubble formation, the image of the 
bubble appears to be stationary. This method is evidently useful only when the 
system is transparent. 

Various other methods devised for measuring frequency are all based on 
the fact that the bubble creates a disturbance in a continuously occurring 
phenomenon. This distrubance can be utilized as an obstruction to the passage 
of light, x-rays, or y-rays; or as a disturbance in the noise pattern or in the 
flow of current. Following are some of the methods which make use of the 
above ideas. 

a . Photoelectric Cell Counter. When a light beam passes just above the 
nozzle and through the column on to a photoelectric cell, a voltage which 
depends on the intensity of the emergent beam is developed. If a bubble is 
formed at the nozzle tip it obstructs the light beam in its passage and the 
voltage at the cell drops suddenly. The cyclic formation of bubbles causes 
corresponding fluctuations in the voltage. The rate of voltage fluctuations 
directly gives the value of the frequency of bubble formation. This method 
has been successfully employed by Sullivan et al. (SI7) and Lamont and 
Scott (LI). 

In order to carry out the counting by mechanical devices, the count rate 
is divided by a factor of two or four and the output is amplified. 

b. Cinephotographic Method. High-speed photography can also be used 
for measuring the frequency. This method is particularly useful when direct 
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readings of bubble volume are required. The camera is operated at a known 
speed and the film is projected at a much lower speed. Knowing the camera 
speed, the projector speed, and the number of bubbles per unit time at the 
projected nozzle, the frequency of bubble formation can be directly evaluated. 
This method has been used by Venugopal, Kumar, and Kuloor (V2) during 
their studies in liquid-phase oxidation of acetaldehyde to acetic acid. 

All the above methods which make use of the passage of light through the 
liquid-gas mixture are applicable only when the liquid transmits light. For 
opaque systems like fluidized beds and some non-Newtonian fluids, methods 
based on different principles have to be employed. 

c. Pressure Pulse Technique. When a bubble detaches itself from the 
nozzle tip, it generates a pressure fluctuation which can be picked up by a 
suitably situated microphone. The pulse frequency can be measured in com¬ 
parison with a standard variable-frequency generator. Two groups of in¬ 
vestigators have employed this method for the measurement of bubble 
frequency. 

Calderbank (Cl) employed a crystal microphone located in the gas supply 
line near the nozzle tip, which was connected to an oscilloscope through a 
preamplifier. The photographic comparison of this signal with a constant- 
frequency (60 cps in this case) test signal, yielded the frequency of bubble 
formation. 

Newman and Lerner (N2) have used an arrangement where the signal 
picked up by a microphone attached to the flat surface below the orifice plate 
is amplified and fed to a loud speaker. The amplified bubble signal is then fed 
to one pair of fixed contacts of a double-pole double-throw switch of which 
the other pair of fixed contacts is connected to an audio-frequency generator. 
The movable contacts of the switch are connected to the vertical and ground 
terminals of an oscilloscope. This arrangement permits the observation of 
either the bubble signal or the sine wave as a function of the internal linear 
time-base of the oscilloscope. 

The signal, amplified to a good sound level at the loudspeaker, is fed to the 
vertical input of the oscilloscope. A stationary trace is obtained on the 
oscilloscope. The frequency of the sine wave of the oscilloscope is varied until 
a single trace is obtained. This frequency is then equal to the frequency of 
bubble formation. 

d. Capacitance Technique. The capacitance of a system changes when a 
bubble is formed between its two probes. The frequency of capacitance change 
corresponds to the frequency of bubble formation, 

Calderbank (Cl) in studying gas-liquid contacting on plates, placed one 
platinum probe (~0.6 cm in length) in the path of the bubble stream and 
another in the undisturbed liquid, both probes being connected in series with 
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a 4.5 V dry cell. A scaler with a readily adjustable discrimination was used in 
conjunction with the above arrangement. Each bubble passing the probe pro¬ 
duced a pulse, every sixty-fourth of which was recorded on a magnetic 
counter with interpolation by a scaler. 

This arrangement could be used with distilled water, without the use of 
any electrolyte. 

A similar technique, different in detail, has been employed by Harrison 
and Leung (H3) for studying bubble formation in fluidized beds. 

Other methods based on y-ray absorption or x-ray cinematography may 
be used for special cases by employing the principles already explained. How¬ 
ever, for most of the ordinary contacting devices, the methods enumerated 
above are adequate. 


III. Influence of Various Factors on Bubble Size 
A. Experimental Set-up 

Having determined the bubble size, the next step is to examine the several 
factors that influence it. Before doing this, it is important to bear in mind a 
general set-up of the equipment commonly used in the formation and study 
of bubbles. The bubble studies normally conducted and reported in literature 
are concerned with single nozzles, even though multiple nozzles are commonly 
employed in industry. This is done to facilitate an understanding of the bubble 
formation phenomenon at a single nozzle, which can then be extended to 
multinozzled systems. Although the equipment employed varies from investi¬ 
gator to investigator, a typical set-up is that used by the present authors and 
their colleagues and described below. 

The set-up can be divided into three sections as shown in Fig. 3: the gas 
supply, the bubble forming section, and the gas metering section. 

The supply of gas under constant pressure can be obtained either by the 
constant-level tank arrangement or by a compressor or cylinder through a 
surge tank. If mass transfer is to be avoided during bubble formation, then 
the gas can be passed through a bubbler containing the same liquid as that 
in which the bubble formation is to be studied. 

The bubble-forming section consists of a control valve (needle valve type), 
a chamber, and the liquid column. It is customary to have flat glass plates as 
two sides of the column, to assist photography and visual observation of the 
phenomenon. A manometer is also provided to evaluate the pressure difference 
across the nozzle. 

The bubbles formed break at the surface, where an inverted funnel having 
an outlet at the top collects them and conveys the gas to the measuring devices 
which have already been discussed in Section II. 
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Fig. 3. Equipment for bubble formation under constant flow and constant pressure 
conditions. 

The measuring devices like rotameters, orifice meters, etc. can also be 
introduced into the equipment prior to the bubble forming device. Keeping 
the above general set-up in mind, we can now proceed to examine the factors 
which influence the bubble size. 

B. Factors Influencing Bubble Size 

These factors can be broadly classified as equipment variables, system 
variables, and operating variables. 

1. Equipment Variables 

These variables are directly decided by the bubble forming device, whose 
major parts are the chamber, the nozzle, and the liquid column. The important 
factors are enumerated below: 

a. diameter, geometry, orientation, nature and material of construction 
of the nozzle, 

b . chamber volume, 

c. arrangement of slots for multiholed plate systems. 

Most of these factors remain unstudied, though the influence of the 
diameter of the nozzle and chamber volume have been reported in the 
literature (B3, D4, D5, H7, PI). 
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2. System Variables 

These factors are associated with the liquid-gas combination chosen for 
investigation. The most important system variables are: 

a. surface tension y 

b. density of the liquid p u viscosity of the liquid /q 

c. density of the gas p g and viscosity of the gas p g 

d. contact angle 6 

e. velocity of sound in the gas c. 

The considerable discrepancies in literature regarding the effects of many of 
the above variables will be discussed later. 

3. Operating Variables 

These are the factors which can be varied by the investigator at will. They 
include 

a. volumetric flow rate of the gas 

b. velocity of continuous phase 

c. head of the liquid in the column (“submergence”) 

d. pressure drop across the nozzle 

e. temperature of the system. 

The pressure drop has been included here for want of a better place. The 
temperature of the system does not directly influence the bubble size, but does 
so indirectly by varying the physical properties of the gas-liquid system. 
Hence it can be omitted from the analysis. Even the other variables are not 
entirely independent. For example, the pressure drop across the nozzle is a 
function of the flow-rate, chamber volume, etc. 

Normally, the various factors do not influence the bubble size to the same 
extent over the entire range of the other variables. Thus surface tension, which 
is one of the most important factors determining the bubble size at vanishingly 
small flow rates, assumes much less importance at higher flow rates. Similarly, 
the viscosity of the liquid is much more important at higher flow rates than 
at the lower ones. 

The known information regarding the influence of various factors is given 
below. Most of the investigators have tried to study the effect of one variable 
at a time, but often minor variations of other variables also occur alongside. 
For example, variation of viscosity by the use of glycerol solution varies the 
surface tension by a few dynes per centimeter. 

C. Effect of Orifice Characteristics 

The observations made below are applicable mainly to wetted nozzles. 
At low gas flow rates, the effect of orifice diameter is such that the volume 
of the bubble is directly proportional to it (B3, D4, VI). At higher flow rates 



268 


R. KUMAR AND N. R. KULOOR 


also the bubble volume has been reported to be a strong function of the 
orifice diameter. Thus Leibson et al. (L2) and Davidson and Amick (D5) find 
that the bubble volume is proportional to the 3/2 power of the orifice diameter, 
whereas Quigley et al. (Ql) find the power to be 2.01. The bubble size is found 
to be independent of the orifice diameter by Silberman (SI2) in a study of 
bubble formation in jets. In the region of single bubble formation, the data 
of Ramakrishnan, Kumar, and Kuloor (Rl) show that the bubble volumes 
for various nozzle diameters tend to join in a single curve at high flow rates. 

Apart from the size of the orifice, the nature of the orifice is of consider¬ 
able importance. Thus the use of an orifice and a tube of the same diameter 
yields different bubble sizes. The orifice plate is found to yield a higher bubble 
volume than the corresponding tube. 

The thickness of orifice plate may also affect the bubble size. This effect 
has been reported to become significant (H4) if the thickness is equal to or 
greater than 100 times the value of orifice diameter. A similar observation 
has also been made by Hughes et al. (H7). 

One of the major variables associated with the orifice is the geometry of 
the opening. Though most of the investigators have used circular orifices, 
slots of various shapes are used in the industry. In bubble-cap columns, the 
slots are not only rectangular but also vertically oriented and for gas-liquid 
contacting, porous unglazed cylinders are used, where the pores are generally 
not circular .Thus the slot geometry is seen to be an important factor that 
requires detailed study. A systematic approach to this has been made by 
Krishnamurthi, Kumar, and Datta (K5, K7). They have compared the bubble 
volume obtained by using alternatively, a standard circular orifice of arbitrary 
diameter and two sets of orifices of other geometries like triangular, square, 
etc., chosen to have either their perimeters or their areas equal to that of the 
standard one. Their work, which was confined to low flow rates (<0.05 
cm 3 /sec), indicated that the bubble volumes obtained from the circular orifice 
did not correspond exactly to those from the noncircular orifices whether 
compared on an equal-perimeter or equal-area basis. They were closest for 
orifices of equal area. On extending this work to higher flow rates (up to 
200 cm 3 /sec), Ramakrishnan, Kumar, and Kuloor (Rl) found that an orifice 
having a noncircular geometry gives bubble volumes equal to those obtained 
from a circular orifice of the same area. Thus any theory for the formation of 
bubbles from circular orifices can also be extended to situations where non¬ 
circular orifices are involved. 

D. Effect of Chamber Volume 

Spells and Bakowski (SI 5) were the first to recognize the importance of 
the chamber volume as a variable. They observed that the inclusion of a 
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35 litre reservoir between the air supply and the bubble slot influenced the 
bubble formation phenomenon. However they did not come to any definite 
conclusions and remarked that their findings needed further confirmation. 

This was done by Hughes et al (H7). They found that for small chamber 
volumes and small flow rates, the bubble volume is virtually independent of 
the chamber volume. Similar is the case when the chamber volume is large 
and the flow rates are normal. At very small flow rates and large chamber 
volumes, the bubbles normally form in doublets, and triplets and their size 
cannot be definitely determined. These conclusions have been verified by 
Davidson and Amick (D5) who varied the chamber volume in their equipment 
from 4 cm 3 to 4000 cm 3 . 

The chamber volume can be varied by varying either the diameter or the 
height of the chamber. It is of interest to know whether the geometrical pro¬ 
portions of the chamber have any influence on the bubble volumes obtained. 
Hayes et al. (H4) studied this aspect of the problem and found that the effect 
of the diameter of the gas chamber is insignificant in the formation of the 
bubbles, provided that the ratio of internal diameter of the gas chamber to 
that of the orifice is equal to or greater than 4.5. When the diameter of the gas 
chamber approaches that of the orifice, the bubble formation occurs as if at 
zero gas chamber volume. The bubble formation is further found to be in¬ 
dependent of the chamber volume, provided that the latter is larger than 
about 800 cm 3 . 

When the chamber volume is between 0 and 800 cm 3 , resonance effects 
are important because then the velocity of sound plays an important role. 
Very little work has been done regarding the resonance effects. 

The influence of chamber volume is closely related to that of the pressure 
difference, AP, across the orifice. It should, however, be defined as the volume 
between the orifice and that point in the gas stream where the pressure drop 
is very large. Normally, this point occurs at the control valve, but does not 
necessarily have to. Davidson and Schuler (D8) have used a sintered plate 
just before the orifice, even though a large gas chamber exists below it. 
Similarly, Krishnamurthi, Kumar, and Kuloor (K9) have obtained high pres¬ 
sure drops near the orifice by packing the chamber with fine glass powder. 
When the pressure drop across the orifice is large and the pressure variations 
occurring during the formation of bubble are small compared to the total 
pressure drop, the gas flow rate does not change during bubble formation and 
the bubbles are said to form under constant flow conditions. This is true also 
when the chamber volume is zero. Thus, under constant flow conditions, zero 
chamber volume corresponds to a high pressure drop across the orifice. 

Similarly, if conditions are such that on the air-supply side of the orifice 
the pressure is maintained constant during bubble formation, as the bubble 
size increases, the pressure inside it decreases, resulting in a higher flow rate 
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to the bubble in the constant pressure arrangement. Therefore, constant flow 
conditions are not maintained and the rate of bubble-growth varies. These are 
the conditions which prevail when large chamber volumes (>800 cm 3 ) 
prevail. 

Davidson and Schuler (D8) observed the above two situations of constant 
flow and constant pressure on the basis of pressure drop. In industry, constant 
pressure conditions are normally employed. 

E. Effect of Submergence 

Most of the investigators studying the formation of bubbles at horizontal 
orifices have chosen the orifice submergence as one of the variables (D4, D5, 
H4, H7, SI5). It is generally agreed that this variable does not influence the 
bubble volume at the tip. This fact has been verified by the work of Datta 
et al (D4) in a study of bubble formation from capillary tips (under essentially 
constant flow conditions), where the hydrostatic head was varied from 213.3 
cm to 60.95 cm for an orifice diameter of 0.141 cm and a frequency range of 
1-125 bubbles per minute. The negligible influence of submergence on the 
bubble volume has also been reported by Hughes et al (H7), who varied the 
submergence from 20.07 cm to 2.032 cm for orifice diameters of 0.107 cm and 
0.1864 cm; and by Davidson and Amick (D5), who varied the submergence 
from 25.4 cm to 2.54 cm. The region of submergence where the bubble 
volume is affected is when the submergence is low (SI5) and less than ap¬ 
proximately twice the bubble diameter (H4). 

In apparent contradiction to these results are the data reported by Pad- 
mavathy, Kumar, and Kuloor (PI), who varied the submergence from 163 cm 
to 66 cm using an orifice diameter of 0.29 cm and found that the bubble 
volume changed by more than 2.5 times. Figure 4 shows a graph plotted 
from the data of the above authors where the bubble volume is seen to de¬ 
crease with increasing submergence. The discrepancy is mainly due to the 
fact that all the earlier investigators had employed either constant flow or 
constant pressure conditions, whereas Padmavathy, Kumar, and Kuloor (PI) 
worked in the intermediate region by having a chamber volume of 60 cm 3 . 
Though Hughes et al. (H7) also had a chamber volume of 60.63 cm 3 , the 
maximum submergence studied by them was of the order of 20 cm 3 . It is seen 
from the graph that at higher values of submergence the bubble volume is 
very sensitive to variations in submergence. At the chamber volume of 66 cm 3 
itself the bubble volume hardly varies with submergence, and at still lower 
values of the chamber volume, the effect of submergence is negligible. Thus 
the study of Hughes et al (H7) was conducted in too low a range of submer¬ 
gence to detect any appreciable change in the bubble volume. 
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Fig. 4. Effect of orifice submergence on bubble volume in intermediate region. 


The submergence, therefore, has no influence on the bubble volume under 
constant flow and constant pressure conditions, although it does affect the 
bubble volume appreciably in the intermediate region. 


F. Effect of Surface Tension of the Liquid and the 
Wetting Properties of the Orifice 

Tate’s law makes use of bubble formation in the measurement of the 
surface tension of a liquid, and hence the fact that surface tension influences 
the bubble volume is obvious. At flow rates tending to zero, the bubble 
volume is such that the upward force due to buoyancy is balanced by the 
downward force of surface tension. So, an increase in surface tension should 
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result in an increased bubble volume. This is borne out by the investigations 
of Datta, Napier, and Newitt (D4) and others (B3, C8, H7). All the above 
authors studied bubble formation at low gas-flow rates and under nearly 
constant flow conditions. 

Davidson and Schuler (D8), however, find that under the conditions of 
constant flow, surface tension has no effect on the bubble volume. Similar 
results are reported by Siemes and Kaufmann (SI 1). Kumar and Kuloor 
(K16) predicted that the surface tension effects are large at low flow rates and 
continuously diminish as the flow rate is increased. 

The conclusions have been verified by Ramakrishnan, Kumar, and Kuloor 
(Rl). The results obtained from two liquids of surface tension values 72 and 
41 dynes per centimeter are shown in Fig. 5. The values of bubble volume in 
the two liquids are seen to be different at low flow rates but merge with each 
other at higher flow rates, indicating that the contribution of surface tension 
to the bubble volume is negligible at higher flow rates. 

At constant pressure conditions, Quigley, Johnson, and Harris (Ql) find 
that for higher flow rates, the effect of surface tension on bubble volume is 
negligible. These authors may not have adequately accounted for the large 
difference in the densities of the two liquids—water and carbon tetrachloride 
—used by them. Davidson and Schuler find that under constant pressure con¬ 
ditions, surface tension does appreciably affect the bubble volume. 

The work currently being conducted by Satyanarayan, Kumar, and Kuloor 
(S3) indicates that the effect of surface tension is more involved than hitherto 
appreciated. Some of their data are presented in Figs. 6. and 7. They find that 
at very small orifice diameters or at very large flow rates, the surface tension 
variation has negligible influence on the bubble volume. For higher orifice 
diameters, the influence is more pronounced at small flow rates, as is evident 
from Fig. 7. 

Thus, generally stating, for both constant-flow and constant-pressure con¬ 
ditions, surface tension affects the bubble volume at low flow rates and has 
negligible influence at high flow rates. This statement is again an over¬ 
simplification, because surface tension effects are less evident for highly 
viscous liquids than for less viscous liquids. 

G. Effect of Liquid Viscosity 

A considerable amount of contradiction exists regarding the influence of 
this variable. Schurmann (S6), while studying bubble formation in various 
liquids from porous earthenware, concluded that viscosity is the principal 
factor which determines the bubble volume. Similar conclusions have been 
drawn by Davidson and Schuler (D8), who find that an increase in viscosity 
causes a marked increase in the bubble size. 
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In glaring contradiction to the above results are the findings of Datta et al 
(D4). These investigators used a number of aqueous glycerine solutions having 
a wide range of viscosities and found that for orifice diameters of 0.036 to 
0.63 cm, a hundredfold increase in viscosity caused a decrease in bubble 
volume of about 10%. All the above conclusions were arrived at under con¬ 
stant flow conditions, although they are equally applicable to constant pres¬ 
sure conditions also. 

The results reported by Quigley et al (Ql) and Coppock and Meiklejohn 
(C8) are in between the above two extremities. The former authors report 
that the bubble volume increases slowly with viscosity, whereas the latter find 
the viscosity to be ineffective. 

The above results seem to be contradictory and irreconcilable. This is not 
so because the effect of viscosity is associated with those of flow rate, surface 
tension, and orifice diameter. Since the effect of viscosity is negligible when 
the flow rate tends to zero, even a large difference in the viscosities of the two 
fluids under consideration does not, at small flow rates, show the influence 
of viscosity. This is precisely the case in the investigations of Datta et al (D4). 
What was mistakenly interpreted as the influence of viscosity, was probably, 
in reality, the influence of the reduced surface tension: though the viscosity 
had been increased a hundredfold, the surface tension was simultaneously 
reduced by about 5 dynes per centimeter. At the extremely small flow rates 
(<0.1 cm 3 /sec) employed, the effect of viscosity was presumably negligible. 

Schurmann (S6) and Davidson and Schuler (D8) used extremely small 
orifices, so that the downward surface-tension force was negligible, and vis¬ 
cosity was the controlling factor in bubble formation Hence these investigators 
concluded that viscosity has a great influence on the bubble formation. 

All the above conclusions and the data reported will be discussed in detail 
later, when a general model is proposed for bubble formation under constant 
flow conditions. It will then be shown that each of the above observations is 
correct although the conclusions drawn from them are applicable only in 
limited range. 


H. Effect of Liquid Density 

When the density of a liquid is increased, the buoyancy force correspond¬ 
ing to a specific size of the bubble increases, whereas the surface-tension force 
may remain constant. Thus, for a definite amount of surface-tension force, 
the bubble volume obtained is smaller. 

Benzing and Myers (B3), Coppock and Meiklejohn (C8), and Davidson 
and Schuler (D8) agree that the bubble volume decreases with increasing den¬ 
sity of the liquid. Quigley et al (Ql) disagree with the above and observe that 
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Fig. 5. Effect of surface tension on bubble volume under constant flow conditions. 

the liquid density has no effect on the final bubble volume. This discrepancy 
can be explained as follows: 

The increased density of a liquid does increase the buoyancy at all levels 
of bubble formation but its effect is most pronounced for liquids of high vis¬ 
cosity and at low flow rates. A downward force in addition to that of surface 
tension is created by the bubbles which expand or move, because a certain 
amount of the liquid is carried along with them. Although the volume of the 
liquid being carried along is independent of the liquid density, the downward 
force increases with increasing density. This net effect of an increase in the 
downward force is in direct opposition to what takes place at zero flow rates. 
The first set of investigators mentioned above used either very small flow rates 
or very high viscosities. Under these conditions, the increased downward 
force resulted in their observing a decrease in the bubble volume with in¬ 
creasing density. On the other hand, Quigley et al (Ql), who employed high 
flow rates and low viscosities, were led to observe the noninfluence of density 
on the bubble volume. 

I. Effect of Gas Properties 

There has been no systematic study of the influence of gas properties. 
The gas density appears in the analysis either in the virtual mass term or in 
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Fig. 6. Effect of surface tension on bubble volume for small orifice diameters under 
constant pressure conditions. 





Volumetric flow rate (cm^/sec) 

Fig. 7. Effect of surface tension on bubble volume for large orifice diameters under 
constant pressure conditions. 
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the buoyancy term. It is generally omitted from the analysis on the assumption 
that it is negligible when compared with the liquid density (i.e., p t - p g & p t ; 
P g +T*Pl 

The viscosity of the gas has again a negligible effect on the bubble volume. 
Hence, the gas properties can reasonably be considered to be ineffective. 

J. Effect of Gas-Flow Rates 

In the initial stages of bubble formation studies the frequency of bubble 
formation was considered as an important parameter. Later on, the frequency 
was observed to be decided by the flow rate and the bubble volume. Though 
the flow rate could be controlled independently, the frequency could not be 
controlled. More important is the fact that when flow rate is increased con¬ 
tinuously for a particular system, a stage is reached when the frequency of 
bubble formation remains essentially constant, whereas the bubble volume 
changes. Thus the bubble volume can assume an infinite number of values 
for a constant frequency. On the other hand, the bubble volume is a single¬ 
valued function of the flow rate provided all the other conditions are main¬ 
tained constant. Thus, for purposes of developing a correlation, it is more 
reasonable to employ the gas-flow rate, which is an independent variable, 
rather than the frequency, which is a dependent variable. Hence the wide 
range of gas flow rate as an operating variable. 

There exists a general agreement on the pattern of bubble-volume varia¬ 
tion when the flow rate is increased. As the flow rate is gradually increased 
from zero, the bubble volume remains fairly independent of the flow rate, 
whereas the frequency increases. The only exceptions to the above observa¬ 
tion are provided by the results of Maier (M2), Datta et al. (D4), and 
Krishnamurthi, Kumar, and Kuloor (K8). They find that the bubble volume 
first decreases with increasing flow rate, passes through a minimum, and then 
starts increasing. This behavior is restricted to capillaries when inviscid 
liquids like water are used. They also find that with increasing viscosity the 
extent of decrease in bubble volume becomes increasingly less until, for very 
highly viscous liquids, there is no decrease at all. 

Suspecting that constant-flow conditions are not realized even for capil¬ 
laries and that the effect is due most probably to the chamber volume instead 
of kinetic effects as explained by the earlier investigators, data were col¬ 
lected using empty capillaries and capillaries filled with glass powder. The 
results show that for glass-powder-filled capillaries, the minimum is absent 
(K8). 

Further, the constancy of bubble volume with flow rate at small flow rates 
which is observed by many investigators is true only for inviscid liquids having 
high surface-tension effects. If highly viscous liquids (p > 500 cp) are used, 
the bubble volume increases very fast with the flow rate. On increasing the 
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flow rate, at first both the bubble volume and the frequency increase, but later 
on a stage is reached when the frequency remains essentially constant whereas 
the bubble volume continues to increase. Though these regions are observed 
for all the systems studied, the conditions under which one region ends and 
the other begins are not clear. There has been no systematic investigation of 
this aspect of bubble formation, though Davidson and Amick (D5) find that 
the flow rate at the transition to constant-frequency bubbling increases 
rapidly with orifice diameter. 

K. Effect of Continuous-Phase Velocity 

Maier (M2) has given quantitative data showing that the continuous-phase 
velocity results in a reduction in bubble size. During a study of bubble for¬ 
mation from vertical nozzles, Krishnamurthy et al (K13) observed a decrease 
in the bubble volume resulting from an increase in buoyancy caused by the 
continuous-phase velocity. These authors developed equations based on 
drag considerations which can predict the bubble volume when the con¬ 
tinuous phase has a velocity. But, in their study, the continuous-phase velocity 
is so directed as to decrease the bubble volume, and hence the results cannot 
be generalized. 

If, however, the velocity tends to decrease the buoyancy (as in downward 
flows), there is every likelihood that the final bubble volume may be larger 
than that obtained without the continuous-phase velocity. 

L. Conclusions 

In general, it can be said that there exists a considerable amount of dis¬ 
crepancy in literature regarding the influence of various factors mentioned 
above, on the bubble volume. These discrepancies are listed in Table I for 
easy reference. 

Most of the above discrepancies have arisen because of the lack of 
appreciation by various investigators of the effects of the chamber volume, 
the types of flow associated with it, and of the interaction of the several 
variables considered, such as the viscosity, surface tension, and flow rate. 
A proper understanding of such interactions is essential if the discrepancies 
listed in Table I are to be completely explained. 


IV. Bubble Formation under Constant Flow Conditions 


A general model applicable to the formation of bubbles under all kinds 
of operating conditions is not yet available. The complex nature of the prob¬ 
lem may very well delay the emergence of such a theory. Even under constant 
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TABLE I 

The Influence of Liquid Properties on Bubble Volume as Reported by 
Various Investigators 0 



Investigators 

Viscosity 

Reported effect of: 
Surface 
tension 

Density 

I. 

Datta et ah (D4) 

Negative 

(small) 

Positive 

— 

2. 

Quigley et al (Ql) 

Positive 

(small) 

None 

None 

3. 

Coppock and Meiklejohn (C8) 

None 

Positive 

Negative 

4. 

Davidson and Schuler (D8) 

Positive 

(large) 

None— 
constant flow 
Positive— 
constant pressure 

Negative 

5. 

Benzing and Myers (B3) 

None 

Positive 

Negative 

6. 

Siemes and Kaufmann (SU) 

Positive 

(large) 

None 

None 


0 “Negative'" means the bubble volume decreases with increase in the value of the 
property. 


flow conditions, the theories available in literature are restricted to special 
situations only, which are discussed below. 

A. Bubble Formation in Inviscid Liquids 

Some investigators have excluded the surface-tension force from the 
analysis of bubble formation since its effects have been found to be negligible 
at high flow rates. 

1. Bubble Formation in Inviscid Liquids Neglecting Surface Tension Effects 

The omission of the surface-tension force reduces the problem to a hydro- 
dynamical one. Two models are available for this situation. 

a . Davidson and Schuler's Model (D9). The bubble is assumed to be form¬ 
ing at a point source where the gas is supplied. As the bubble is formed, it 
moves upward with a velocity which is decided by the various forces acting 
on it. Detachment is considered to take place when the center of the bubble 
has covered a distance equal to the sum of the radius of the bubble and the 
radius of the orifice. For small orifice diameters, the extra distance of the 
orifice radius is neglected, resulting in a considerable simplification of the 
analysis. An idealized sequence of bubble formation is shown in Fig. 8. 

At each stage of bubble formation, the bubble is assumed to be spherical. 
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(a) (b) (c) 

Fig. 8. Idealized sequence of bubble formation in a liquid—Davidson and Schuler’s 
model (D8). 

For evaluating the upward movement of the bubble during its formation, the 
upward force considered is that of buoyancy, which can be written as Vp { g 
when p g <$ p t . Since the bubble is surrounded by the liquid, a part of the 
liquid is carried along with the bubble. For an irrotational flow, as in the 
present case when a sphere is moving away from a wall, the volume of the 
liquid carried along with a bubble is 11/16 of the volume of the sphere (M4). 
Thus, the rate of change of momentum of the bubble due to buoyancy is 
(dldt)[{\\j\6)Vp { v\ On writing the velocity of the bubble v as dsjdt , the final 
equation of motion for the bubble becomes 

Vp l 9 = (d/dm\/16)Vp l (dsldtn (1) 

As the flow remains constant throughout the bubble formation, the 
volume of the bubble at any time can be written as 

V=Qt (2) 

Equation (2) is substituted in Eq. (1) and the latter integrated by using the 
following boundary conditions: 

At t = 0, {dsjdt) = 0 and s = 0 
Thus, 

s-4gt 2 /\l (3) 

At detachment, i.e., at t = t c 

S = r F = (3Qt c /4n) il3 (4) 

Substituting Eq. (4) in Eq. (3), we have 

t c = (ll/4<7) 3/5 (3e/470 1/5 (5) 

The final bubble volume V F is given by 

V F — Qt c = 1.378(£) 6/5 /# 3/5 ) (6) 

This equation closely resembles the empirical expression of Van Krevelen 
and Hoftijzer (VI) for the bubble formation in inviscid liquids, provided that 
the gas density is negligible compared to the liquid density. Their relationship 
is 


V F = 1.722 (Q 6l5 lg V5 ) 


( 7 ) 
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Davidson and Schuler (D9) have given another equation which takes into 
consideration the residual bubble that forms the nucleus of the succeeding 
bubble. If this residual volume is V 0 , then instead of writing V = Qt, we have 
to write 

V=Qt + Vo (8) 


Proceeding as before, we obtain 


s 


16 # 

11 


4 


2 Q 2 Q 2 



(9) 


The bubble is now assumed to detach when its center has covered a dis¬ 
tance equal to the sum of the radius of the final bubble and the radius of the 
orifice. If the radius of the orifice is R and if it is assumed that V 0 = (4;r/? 0 3 /3), 
then the time of bubble formation can be obtained by plotting on the same 
axes Eq. (9) and (r + R) as a function of time from Eq. (8). 

The quoted authors (D9) collected data on bubble volumes in water, 
aqueous glycerol, and petroleum ether. They have used Eq. (6) for verifying 
bubble volumes obtained for flow rates up to 3 cm 3 /sec. They find that 
theory and experiment agree excellently only in the flow range of 1.5 to 
3.0cm 3 /sec and not below 1.5 cm 3 /sec. This discrepancy has been qualitatively 
explained by them on the basis of surface tension effects, but there is no 
quantitative explanation. Although the equation has not been verified from 
3 to 15 cm 3 /sec, the authors feel that it would be applicable. Beyond 20 cm 3 /sec, 
the experimental values have been compared with those obtained by using 
Eqs. (8)—(9) and considerable deviation has been observed. 

The deviation of the experimental values from the theoretical ones has 
been explained as being due partly to the upward current induced by the 
bubbles in the liquid surrounding the orifice and partly to the distortion in the 
spherical shape of the forming bubbles. 

Davidson and Harrison (D6) have brought about a change in Eq. (1) by 
using a value of 1/2 in the place of 11/16 for the virtual mass term. Proceeding 
in exactly the same manner as earlier, they obtained 

n-=1.138(G 6/ V /5 ) (10) 

Equation (10) has been tested for the data of a large number of investi¬ 
gators. The data are plotted in Fig. 9 along with the Davidson and Harrison 
line (D6). The agreement is good enough to obtain approximate values, 
though in most of the cases the theoretically predicted volumes are smaller 
than those obtained experimentally. 

ft. Kumar and Kuloor's Model (XI8). In this model, the bubble formation 
is assumed to take place in two stages, viz. the expansion stage and the 
detachment stage. The bubble is assumed to stay at the orifice in the first 
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Fig. 9. Data on bubble formation in inviscid liquids. 


stage, whereas in the second stage it is assumed to travel away from the tip 
until it detaches itself. An idealized picture of the bubble formation according 
to this model is shown in Fig. 10. Figure 10a shows the expansion stage 
where the bubble expands while its base remains fixed. This stage stops as 
soon as the bubble base starts moving away from the nozzle tip. This marks 
the beginning of the second stage, which is terminated completely when the 
bubble detaches from the neck. During the second stage, the bubble not only 
moves away from the gas source, but also goes on expanding due to the 
incoming gas. 





Expansion stage Detachment stage Condition of detachment 

Fig. 10. Idealized sequence of bubble formation—Kumar and Kuloor’s model (K18). 

These two stages can also be anticipated from theoretical considerations. 
When the flow rates are vanishingly small, the surface tension is the main 
downward force and buoyancy the upward one. The bubble base must remain 
fixed to the nozzle tip until the buoyancy force exceeds the surface tension 
force. Apart from this, there is a drag force due to the expansion rate because 
an expanding bubble is equivalent to a moving bubble (H4). When the orifice 
diameter tends to zero and the surface tension force is negligible, the resisting 
force is due solely to the drag of the expanding bubble. This force is large 
even for small flow rates because the bubble grows from a point source where 
its rate of expansion is very large. As the bubble grows, the drag due to 
expansion goes on decreasing, whereas the buoyancy force increases con¬ 
tinuously. The first stage comes to an end when the buoyancy force equals 
the downward forces. 

The end of the expansion stage marks the beginning of the detachment 
stage. Now, the buoyancy force is higher than the downward resisting forces. 
The other upward force resulting from the kinetic energy of the gas is so small 
as to be negligible (D8). The main aspect in this stage consists of the motion 
of an expanding bubble and can be quantitatively expressed through Newton's 
law of motion. From this, the time of detachment t c is evaluated as the time 
required by the bubble-base to cover a distance equal to the radius of the 
bubble from the first stage. On multiplying t c with the volumetric flow rate, 
the extra volume entering the bubble during the second stage is obtained. 

The final volume V F can be evaluated by finding individually the value of 
the force-balance bubble volume Vf> and the volume Qt c entering the bubble 
during detachment, and then adding the two. Thus, 

V F = V fb + Qt c (11) 

(i) Evaluation of V Jb : the force-balance bubble volume is determined by 
making use of the equality of the upward buoyancy force with the downward 
drag force due to expansion. 

The upward buoyancy force = V(p t - p g )g 


( 12 ) 
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The downward expansion force can arise from two sources, viz. the drag and 
the rate of change of momentum of the bubble and a part of the surrounding 
liquid. For an irrotational flow, the drag force on a bubble forming in an 
inviscid fluid is zero. The drag force is present if there is a fully developed 
wake behind the forming bubble. The forming bubble, however, is similar to 
a sphere being accelerated from rest, as pointed out by Davidson and Schuler 
(D9). For such a situation, the flow is essentially irrotational and the wake 
is not established until the body has covered an appreciable distance (Gl, P4). 

The other force resulting from expansion can be computed from the rate 
of change of momentum of the forming bubble, i.e., ( d/dt e )(Mv e ). Here M is 
the virtual mass of the bubble and is the sum of the mass of the gas in the 
bubble and that of 11/16’ths of its volume of the liquid surrounding it, and 
v e is the velocity of expansion of the bubble. Thus, 

M=[p g + (n/l6) Pl ]Qt e (13) 

or if p g <p u 

M = (11/16)p, Qt e (14) 


The upper part of the bubble moves with a velocity equal to the rate of change 
of its diameter, whereas its base remains stationary. Thus, the average 
velocity of the bubble can be assumed to be that with which the center of the 
bubble is ascending, which is equal to the rate of change of the bubble radius: 


Further, 


dr _ Q _ QV ~ 2/3 

dt e 4 nr 2 47i(3/4^) 2/3 


d x _, dv e 


dM 

dt e 


(15) 


(16) 


The values of dv e /dt e and dM/dt e found by the differentiation of Eqs. (13) 
and (15), respectively, are 


and 


dMldt e = [p g + (U/l6) Pl ]Q (17) 


dVe_ _ e 2 K ~ s/3 
dt e 6n(3/4n) 213 


(18) 


Substituting various terms in Eq. (16) and simplifying, we obtain 



6 2 li>, + (ii/i6)p.]K- 2/3 
127i(3/47t) 2/3 


( 19 ) 
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At the end of the expansion stage we have 

V (0 _ Mfl . 6 a D>, + (n/i6)p,]^ 

V fb\Pl Pg/9 i>waM_\2/3 

Solving for V fb , we obtain 


- 2/3 


127t(3/4t0 2/3 




G 2 [p, + (ll/16)p,] 


I2n(3l4n) 2, \p t - p g )g 


3/5 


For p.<p u 


v fb = 


ii e 2 


192n(3/4^) 2/3 gJ 


3/5 


( 20 ) 


( 21 ) 


( 22 ) 


(ii) Evaluation of the time of detachment, Equating the rate of change 
of momentum of the bubble to the net force, we obtain 


(djdt)(Mv') = (V fb + Qt)( Pl - p g )g (23) 

Here v\ the velocity of the center of the bubble, is made up of two parts, viz. 
the velocity of the bubble base due to movement and the velocity of the 
center due to expansion. Thus, 

tf = v + dr/dt (24) 


Substituting Eq. (24) in Eq. (23) and simplifying, we obtain 


M Jt + v ~!i7 = ( Vfb + ® l ^ Pl ~ Pa)9 ~ 


8 2 [P g + (H/16)P|](F /t + Qty 2 / 3 


127r(3/47r) 2/3 


(25) 


The equation of motion is more useful in the above form when expressed 
in terms of v , the velocity of the base, because the condition of detachment 
pertains to the bubble base and not to the bubble center. Equation (25) can 
be written as 


dvjdT+v\T=P- AT“ 5/3 


(26) 


where 


iPl - Pg)9 
QlPg + (1 l/16)p / ] 


127t(3/4ti) 2/3 


and 


T = (V fb + Qt) 
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Equation (26), when solved using the boundary conditions v = 0 at t = 0 and 
T = V fb at t = 0, yields 


PT 

v = tJ-- 3NT -2/ 3 + 
2 


3N(F}i 3 )--(F^) 


T 


(27) 


Expressing v as dx/dt and solving the resulting equation for a variation of 
bubble volume from V fb to V F when x changes from zero to r fb , we obtain 

r fb = ~ (V F 2 - V}„) - ^ (K' /3 - V'fl 3 ) 

+ ^ (3JV Vfl 3 ~ \ ^/ft)(ln F f - In F /6 ) (28) 

Equation (28) permits a direct evaluation of the final bubble volume, V F , 
without first calculating the time of detachment. Calculations made on the 
basis of the above equation are presented in the form of a curve in Fig. 11 
using Davidson and Schuler’s (D9) data. 



Flow rate (crr^/sec ) 


Fig. 11. Comparison of Kumar and Kuloor’s model (K18) with the data and model of 
Davidson and Schuler (D9) for bubble formation in inviscid liquids. 
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The authors have collected data for flow rates up to 15 cm/ 3 sec using 
water as the liquid, and have found the model to be applicable. Figure 9 
shows a comparison made between the theoretical values of Kumar and 
Kuloor (K18), the data obtained at higher flow rates by various investigators, 
and the Davidson and Harrison (D6) equation. 

Equation (28) can be simplified at the cost of a certain amount of accuracy. 
The second term on its right-hand side denotes the difference in the radii of 
the final bubble and the force-balance bubble, and its value is normally small. 
Similarly, the third term on the right-hand side is a multiplication of two small 
numbers and is hence very small. As these two terms have algebraically oppo¬ 
site signs, their difference can be neglected when compared with the first 
term on the right-hand side. With all these simplifications and by expressing 
r fb in terms of V fb , Eq. (28) reduces to 

V F - 0.976«2 6/ V /5 ) (29) 

This simplified equation is of the same form as the equations of Van 
Krevelen and Hoftijzer (VI) for air-water system and the theoretical equation 
of Davidson and Schuler (D9) for inviscid liquids. It is interesting to observe 
that the various equations differ only in the value of the constant although 
they are based on different mechanisms. 

To indicate the error introduced by the simplification of Eq. (28), the 
values of bubble volume at different flow rates calculated by both Eqs. (28) 
and (29) have been presented in Table II. The simplification is seen to give 
rise to errors of 10 to 30%, hence restricting the use of Eq. (29) to cases where 
only approximate values are needed. 


2. Bubble Formation in Inviscid Liquids with Surface Tension 


At vanishingly small flow rates, the bubble volume is decided mainly by 
surface tension. Thus, 

^CO.0 
(Pi - Pg)Q 

At very large flow rates, surface tension effects, though present, have a 
relatively smaller contribution. Between these two extremes is a region of flow 
rates where neither the flow rate nor the surface tension can be neglected and 
the final bubble volume is highly sensitive to both. 

Kumar and Kuloor (K16) have extended their model to include surface 
tension. Here again, evaluation of the bubble volume comprises calculation 
of V fb and t c . V fb is evaluated as before, but also taking the surface tension 
force nDy cos 9 into account. Thus, the final equation is 


V fh - 


llg 2 


_ T /~ 2 / 3 — 

fb 1927i(3/4rc) 2/3 0 fb 


nDy cos 9 
Pi 9 


(31) 
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TABLE II 

Values of Bubble Volume Calculated by Rigorous [Eq. (28)] and 
Simplified [Eq. (29)] Equations for Inviscid Liquids without 
Surface-Tension Effect 


No. Volumetric flow rate 
(cm 3 /sec) 

Bubble volume calculated 
from the 

(i) (ii) 

Simplified Rigorous 

equation equation 

Ratio = (i)/(ii) 

1 

1.0 

0.0156 

0.0181 

0.8625 

2 

2.5 

0.0470 

0.0568 

0.8270 

3 

10.0 

0.2482 

0.3650 

0.6810 

4 

15.0 

0.4037 

0.5650 

0.7140 

5 

20.0 

0.5702 

0.7625 

0.7480 

6 

25.0 

0.7452 

0.9050 

0.8232 

7 

35.0 

1.1160 

1.4700 

0.7890 

8 

100.0 

3.9300 

4.9000 

0.8020 

9 

200.0 

9.0360 

9.8500 

0.9160 

10 

1000.0 

62.3400 

73.1000 

0.8512 

11 

10000.0 

988.1000 

1150.0000 

0.8510 


When Q tends to zero, Eq. (31) simplifies to Eq. (30). If the surface-tension 
effect is negligible, then the right-hand side of Eq. (30) vanishes and Eq. (31) 
reduces to Eq. (22), which is applicable to inviscid liquids without surface 
tension. 

In order to calculate the final bubble volume, we again write the equation 
of motion of the bubble center which by, inclusion of the surface-tension 
effect becomes 

M{dv'\dt) + v'(dM/dt) = ( V fb + Qt)Apg — nDy cos 8 (32) 

where A p = (p t — p g ). The following substitutions are now made: 

( Pl - Pg)9 =p 
<2[P 9 + (11/16)P,] 


12tc(3/4t0 2/3 
nDy cos 9 

etp,+( u/i6)p,] 
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The final equation for the evaluation of V F becomes 
r fb = {q(V 2 f ~ V 2 fb ) - - Vy 2 ) V fb ) 

+ £ [(• JV fb + 3 Nvyy - F - F^](ln V F - In V /b ) (33) 

This equation has been verified (K16) by using two orifices of diameters 
0.05 cm and 0.59 cm. The values of Dy cos 9 for the two are 3.6 and 31.9, 
respectively. The data for the 0.05 cm diameter orifice can be adequately ex¬ 
pressed by Eq. (28). It is necessary to take into account the surface tension 
for the latter case through Eq. (33). 

Though the effects of surface tension are present throughout the range of 
flow rates, their contribution to the bubble volume decreases as the flow rate 
increases. This is evident from Fig. 12 wherein for different orifice diameters 
data, with and without surface tension effects, are plotted for the air-water 
system. The dotted lines indicate that the surface tension has been taken into 
account, whereas the full lines indicate the omission of surface tension from 
the analysis. In each case, the dotted line is seen to tend towards the full line, 
thus showing that for large flow rates the surface tension need not be con¬ 
sidered in the evaluation of bubble volume. Further, for inviscid liquids the 
region where the dotted line approaches the full line depends on both the 
surface tension and the orifice diameter. The higher the value of the product 
of the two, the higher is the flow rate at which the surface tension effects 
become negligible, as can be seen in Fig. 12, 

Thus, we conclude that the surface-tension effects can be neglected only 
at higher flow rates and not at lower ones. The error caused by neglect at low 
flow rates can be quite large, its magnitude depending on the orifice diameter. 
Equation (33) can be used for inviscid liquids both in the presence and 
absence of surface-tension effects. 

The use of Eq. (33) as it stands, involves trial and error method and is 
time-consuming. At the cost of some accuracy, the equation can be consider¬ 
ably simplified. 

The second and the fourth terms on the right-hand side of Eq. (33) are not 
only very small but also have opposite signs. Their difference is still smaller 
when compared to the rest of the terms. Thus, for a wide range of experi¬ 
mental conditions these two terms can be neglected. The simplified equation 
can be written as 

r fb = (Pm(V F 2 - Vfb) - (JIQHYf - Vfb) (34) 

Equation (34) is of second degree in V F and can be solved directly. Thus, 

B + (B 2 — 4AC) 112 
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Fig. 12. Influence of surface tension on bubble volume for inviscid liquids. 


where 


A = P/4Q 
B = J/Q 

and 

C = (JIQ)V fb ~(PI4Q)V 2 fb -r fb 

Equations (33) and (35) have been compared, and the values obtained by 
the two equations for arbitrary flow rates above 5 cm 3 /sec are presented in 
Table III. This table shows that the error introduced in the evaluation of 
bubble volumes by the use of the simplified Eq. (35) is of the order of 15%. 
Thus, Eq. (35) can be employed only when approximate values are required. 

B. Bubble Formation in Highly Viscous Liquids 

1. Davidson and Schuler's Model (D8) 

This model makes use of the same concepts as for inviscid liquids except 
that the viscous resistance is now taken into consideration. Further, the 
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TABLE III 

Values of Bubble Volume Calculated by Rigorous [Eq. (33)] and 
Simplified [Eq. (35)] Equations for Inviscid Liquids with 
Surface-Tension Effect 0 


Bubble volume calculated 
from the 


No. 

Volumetric flow rate 
(cm 3 /sec) 

(i) 

Simplified 

equation 

(ii) 

Rigorous 

equation 

Ratio = (i)/(ii) 

1 

5 

0.23 

0.26 

0.88 

2 

10 

0.39 

0.39 

1.00 

3 

20 

0.78 

0.71 

1.09 

4 

100 

4.38 

3.65 

1.20 


a y = 71.7 dyn/cm; D = 0.25 cm. 


inertia of the liquid can be neglected at low flow rates, whereas it cannot 
be neglected for high flow rates. Thus, two separate equations have to be used 
for the above two cases in order to determine the bubble size. 

a. Small Flow Rates. In this case, the bubble is assumed to be moving at 
any instant with the Stokes velocity appropriate to its size. The bubble is 
assumed to be spherical and unaffected by the presence of other bubbles. 
Further, the gas momentum and the liquid circulation are assumed to be 
negligible. 

The velocity of the bubble at any instant can be written as 

v = ds/dt = 2gr 2 p l /9p (36) 

Based on Eq. (36) and the condition of detachment, the final equation is 
V F = (An!^ l \\5 l iQI2p l g)^ (37) 

This equation has been tested by Davidson and Schuler (D8) for an orifice 
radius of 0.0334 cm, liquid viscosity ranging from 515 to 1040 cp, and a flow 
range of about 0 to 2.5 cm 3 /sec. 

It may, however, be pointed out that the lowest flow rate obtained by 
them is about 0.25 cm 3 /sec. Below this value, Krishnamurthi (K6) has found 
that the surface tension is very important and should be taken into account if 
bubble volumes are to be predicted accurately by theory. 

b. Large Flow Rates. The momentum of the surrounding liquid is also 
considered here. The equation for bubble movement then becomes 

Mg — {\\j\6)[d{Mv)jdt] + 6nrpv 


(38) 
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We know that M — p t Qt 


Hence, 


QSI = ^Q< 


dv 

li 


11 „ . n 

+ 72 v Q + 6n ~ 

16 Pi 



(39) 


The above equation is applicable when the bubble detaches at s = r F . But 
if the orifice diameter is large, the authors recommend the use of the relation 
s = (r F + R) to obtain a better value of the time of bubble formation. Then 
Eq. (39) is modified to 

(Qt+ v 0 )g = (Qt + F 0 ) ^ Qv + 6n ~ ^ j (Qt + F 0 ) 1/3 u (40) 
Rearranging, we obtain 


where 

and 



V=(Qt+V,o) 


A 


96n /A/_3_\ 1/3 

TfU/te/ 


(41) 


On integrating Eq. (41) between the limits V = F 0 and V = V, and by 
assuming that at F = F 0 , v = 0, we obtain 


ds 

v = Q — = 
dV 


48^ 

-J/2/3 

5F 1/3 

20 

ne 

. e 

C 2 

C 3 


where 


60F _1/3 120F _Z/3 

120F -1 ! L 

C 5 J ~ Fexp(CF 1/3 ) 


(42) 


L = 


4 Sg 
116 


exp(CF* /3 ) 


~ E 3/3 
. C 


5Fq /3 20F o 
C 2 c 3 


60Fq / 3 120Fi /3 

c 4 "* c 5 


120' 

c 6 . 


and 

C=3A/Q 


Equation (42) is again integrated from V = V 0 where s = 0 to V = V, resulting 


in 


s = ■ 


48 q [3F 5/3 15K 4/3 20K 90F 2/3 3607 1/3 minVV 


11 Q 2 L 5 C 


“t" ' a 

5C C 3 


C 4 


+ 


/■ 


F 0 


-LLy +3 g <- c 7>T 

eL »=i «•«! J 


KO 


( 43 ) 
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A graph of s versus t can be plotted by making use of Eq. (43). The time 
required to reach a volume such that s = (r F + R) is thus determined. The 
final bubble volume is obtained by multiplying the calculated time with the 
volumetric flow rate of the gas. 

The authors have verified Eq. (43) for an orifice diameter of 0.192 cm. The 
results, given in Table IV, show good agreement between the theoretical and 
the experimental values. The data for the range of 2,5 cm 3 /sec to 14 cm 3 /sec 
are not reported by the authors and it is not known which of the equations is 
applicable. 


2. Kumar and Kuloor's Model (AT9) 


The model for inviscid liquids is equally well applicable to viscous liquids 
also, provided that the resistance due to viscous drag is included in the analy¬ 
sis. As a first approximation, the viscous drag may be evaluated by a Stokes 
resistance term, since the bubble is not followed by a wake. Thus we proceed 
as before, first evaluating the force-balance bubble volume Vfb and then the 
total bubble volume by reference to the detachment stage. 

The extra viscous resistance due to expansion to be included in the analysis 
is 


= _ I/-1/3 

e 2(3/4 tt) 1/3 


(44) 


The equation for v fb then becomes 


V fi = 


ii q 2 u * 2/3 

1927I0(3/4ti) 2/3 


+ ^( 3 / 4 *)" 1/3 
20 



(45) 


It is interesting to note that Eq. (45) reduces to the inviscid case when the 
viscosity of the liquid p is equal to zero. On rearranging and calculating various 
constants in Eq. (45), we obtain 

V fb = 4.85 x 10 ~ 5 Q 2 VJ b 213 + 2.47 x 10 -^IpdQVfV' 3 (46) 
The equation of motion for the bubble center can now be written as 


(d/dt)(Mv) = ( V fb + Qt)(Ap)g - 6nrpv' (47) 

where v\ the velocity of the bubble center, is the velocity of the center due to 
expansion, dr/dt , plus the velocity of the base, v. Substituting this value of v 
in Eq. (47) and simplifying, we obtain 




— 6nrixv e — 6 nr/xv 


( 48 ) 
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TABLE IV 

Comparison of Theoretical and Experimental Values of 
Davidson and Schuler (D8) for Viscous Liquids at 
Large Flow Rates 0 


No. 

Flow rate 
(cm 3 /sec) 

Kinematic 

viscosity 

(stokes) 

Bubble volume 
(cm 3 ) 

Theory 

Experiment 

Theory 

Experiment 

1 

18 

7.9 

2.1 

1.9 

0.90 

2 

24 

7.9 

2.6 

2.5 

0.96 

3 

33 

7.9 

3.2 

3.3 

1.03 

4 

15 

6.5 

1.6 

1.6 

1.00 

5 

26 

6.5 

2.4 

2.4 

1.00 

6 

35 

6.5 

3.0 

3.1 

1.03 

7 

14 

5.4 

1.3 

1.3 

1.00 

8 

20 

5.4 

1.8 

1.7 

0.95 

9 

26 

5.4 

2.2 

2.2 

1.00 


a Orifice diameter = 0.192 cm. 


It is to be noted that on expressing the motion of the center in terms of the 
motion of the base of the bubble, the various terms of Eq (47) have been split 
into two parts, viz. (i) that associated with the movement of the base due to 
the free motion of the bubble (this would be the only term if the bubble does 
not change its size during its movement), and (ii) that associated with the move¬ 
ment of the bubble center due to expansion. The inertial term has split into 
left-hand-side terms and the second term on the right-hand side, whereas the 
viscous drag term has split into the third and fourth terms on the right-hand 
side of Eq. (48). 

After the evaluation of various terms of Eq. (48), it can be written as 


/ 11 \dv / 11 \ 

( v „+Qi)[p,+ Ti p.) 1 -, + {p.+- 6 f')e’ 

- d'rt + 20to - PM - 2(3^1 » (Kn + 20 

(, '‘ + S0 


- 1/3 


- 671 ( 3 / 4 / zy i3 n(V fb + Qt) 1/3 V 


( 49 ) 
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Although the above equation can be solved as such, the solution is very 
complicated and needs the help of a computer to be practically applicable. 
Hence, it has been slightly simplified. Though the buoyancy force is directly 
proportional to the volume of the bubble, the resisting force is proportional 
only to its cube root. Thus, r can be assumed to be a nonvariable. This is 
considered to be equal to 1.25 r fb rather than r fb since 1.25 r fb gives approxi¬ 
mately an average value of r during the expansion stage. Equation (49) now 
becomes 


dv/dT + A(v/T) = B - CT~ 4/3 - DT~ S/3 (50) 

where 

(1.25)67t(3/47t) 1/3 ^r}£ 3 
QlPg + (H/ 16 )PJ 

... , 96(1.25)*^ 

^ 1 - - 

ii Qpi 

B = (Pl ~ Pg)9 
Q[p g + (H/16)P(] 

16 g 

_ 

- 

lie 


2(3/47t) l/3 [p 9 + (ll/16)p,] 

24 p 
~ 11(3/4tt) 1/3 p, 


127r(3/47r) 2/3 


and T= (V fb + Qt) 

The solution of Eq. (50) with the boundary condition, T = V fb or v = 0 at 
t = 0, is 

_ 1 r ^ (rpA + 1 t/A + 1 \ ^ (T A ~ */3 \/A — 1/3\ 

T A 1.7+7 (T Vfb } JTT73 (t } 


_^_( r .- 2 /3 _ F .- 2/ 3) 


(51) 
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v can be expressed as dx/dt and the resulting equation is solved again for jc 
varying from 0 to r fb (condition of detachment) when T changes from V fb to 
V F . The final solution is 


r /* = 


B 


Q(A + 1 ) 

c 


Q(A - 1/3) 
1 


-A + 1 


[|o 

- rjft] 


(Vf A+ 1 Vfi 1/3 


A + 1 


yA+t 

V fb 



- f (V; A+l V f A b ~ 213 - K}i 3 )] 


(52) 


As A is quite large and D is quite small in most of the situations of practical 
interest, Eq. (52) can be simplified as 


B 


r fb ~ 


2 Q(A + 1) 


(Vi - Vfb) ■ 


3 C 


2204 - 1/3) 


(1 VV 3 - V% 3 ) (53) 


The authors have verified Eq. (53) by using their data and find the agree¬ 
ment to be good. 

The data of Davidson and Schuler (D8) for higher flow rates have also 
been tested. Their experimental values are given in Table IV along with those 
calculated by this model. The authors collected data at flow rates higher than 
those of Davidson and Schuler (D8) and plotted them as shown in Fig. 13, 
where the solid line represents the values calculated through Eq. (53). 


C. Model of Ramakrishnan, Kumar, and Kuloor (Rl) 


This model is an extension of the concepts developed by Kumar and 
Kuloor (K16, K19). The first stage equation for this situation is 


vV h 3 = - 


UQ 2 


lnQVjl 3 TiPy cos evjl 


fb 192n(3/4n) 2l3 g 2(3/4n ) 1/3 gp t 

Evaluating various constants, we obtain 

Vf'b = (4.85)(10 _5 )2 2 +(2.47)(10- 3 )^j21 / }6 3 


SPt 


nDy cos 9Vy b 3 


9 Pi 


(54) 


(55) 
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Volumetric flow rate (crrrVsec) 

Fig. 13. Comparison of the model (K19) with the experimental data for bubble forma¬ 
tion in viscous liquids without surface-tension effect. 

Equation (54) is quite general in nature, and can be used for the evaluation 
of V fb in any situation when the bubble formation is taking place under con¬ 
stant flotv conditions. Thus, if the last term on the right-hand side of Eq. (54) 
is omitted, the equation reduces to that of the viscous case without surface- 
tension effects. Similarly, if the second term on the right-hand side is dropped, 
the resulting equation is the one for the inviscid case with surface tension. 
Further, if both the second and the third terms on the right-hand side are 
deleted, the equation reduces to that of the inviscid case without surface 
tension. 

The equation of motion for the bubble center now differs from that for the 
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viscous case only so far as the inclusion of the surface tension force. Thus, 


dv dM 

M Jt+ v-^(V fb + Qt)A Pg 


e 2 [ Pg + (11/16 )p^y £t +eo ~ 2/3 

127i(3/47t) 2/3 


— 6nri.iv e — 6nr P v — nDy cos 6 (56) 
As indicated before, r for the purpose of viscous resistance can be considered 
to be constant and equal to 1 25r fb . 

dv/dT + A(v/T ) = B - CT ~ 4/3 - ET~ X - DT~ 5/3 (57) 

where 

, (l- 25 Xfa)(3/ 4 ^) 1/3 U^ 

Qlpg + (H/ 16 )P/] 

„ (1.25)(967rr /6 p) 


(Pi - P g )g ~ 16 g 

e[p, + (ii/i6)p,]~ne 


2(3/47r) 1/3 [p g + (1 l/16)p ( ] 


127t(3/47i) 2/3 


nDy cos 0 _ 16nDy cos 0 

e[p, + (ii/i6) ft )* Hep* 


T=(V fb +Qt) 

The solution of Eq. (57) is formally 

A dT/TJ + K 

where R is the right-hand member of the equation and if is a constant of 
integration; whence 

” = jl [ (T A+1 - Vf b (T A - Vj b ) 




C (jA-l/3 _ yA- 1/3) _ _JL_ (T A ~ 2/3 - V A 


A — 1/3 


A-213 


fb 


2/3\ 


(58) 


that is 
dx 

Q~j^ = 


5 

_/ r _ 

V A+i \ 
V fb \ 

- E -U- 

y%\ 

A + 

il T 

T A j 


T A j 


c 

( rp~ 1/3 _ 

v A f r l3 \ 

, D ( T -v 3 ^: 2/ h 


- 1/3 

r 

T A j 

A - 2/3 V T A j 


(59) 
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Integrating for * between the limits 0 and r fb and T between the limits V fb 
and T , we get 


Qx 




3 C 


2(A - 1/3) 

r -A + 1 


(K|/3 _ F 2/ 3) _ 


3D 


( F l/3 _ K}' 3 ) 


_ _ V \' 

l-A + 1 -A + l/ll 

r 

l/A- 1/3 _ 


“ 2/3) 


+ 1 ) 




D 


A - 1/3 


yA-2/3 

A - 2/3 /fc 


(60) 


A large number of calculations show that the last two terms of Eq. (60) 
are extremely small and can be neglected. Then the equation simplifies to 


B 


r fb = 


2 Q(A + 1) 
3 C 


(Vi-V} b )- — (V F -V,j 


2Q(A - 1/3) 


( V\ 13 


AQ 

V 2 fb) 


(61) 


Knowing the force balance bubble volume and the other system 
charctaeristics, the final bubble volume V F can be directly calculated from 
Eq. (61). 

This general equation presents a number of noteworthy features. If the 
surface tension term E is dropped, then the equation corresponds exactly to 
the equation for highly viscous liquids without surface-tension effects. 

Similarly, all the cases discussed earlier become the special cases of this 
general equation, when appropriate terms are omitted. As this equation con¬ 
tains all the variables of importance, the verification of the data of the earlier 
investigators and explanations of the contradictory observations reported in 
literature should be possible by a judicious discussion of it. 


1. Effect of Surface Tension 

Considering the case of vanishingly small flows (Q « 0), both the first and 
the second terms on the right-hand side of Eq. (55) which contain Q , become 
zero, and the force balance bubble volume is obtained directly by equating the 
surface tension force with the buoyancy force. Even in the detachment stage, 
Qt c is zero because Q & 0. Thus, for zero flow rate, the equation reduces to 
the well-known Tate’s law, which is used for finding the surface tension of 
liquids by the pendent drop or the forming bubble method. 
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The same situation exists when the viscosity is large but Q is extremely 
small. The second term on the right-hand side of Eq. (55) is still small when 
compared to the third term and the bubble volumes are decided by the surface 
tension. As the flow is increased, the various terms which contain Q go on 
increasing but the surface tension term remains constant. Thus, the surface 
tension effect goes on decreasing with increasing flow rate both for highly 
viscous and inviscid liquids. For highly viscous liquids, however, the relative 
magnitude of the surface tension force in the overall resisting force becomes 
less. Thus, from the general model, the following conclusions can be drawn 
regarding the influence of surface tension on the bubble volume: 

a. At vanishingly small flow rates, the bubble volume is decided entirely 
by the surface tension and buoyancy forces. 

b . For liquids of low viscosity, the surface tension is important at low 
flow rates, but assumes a continuously lesser importance as the flow rate is 
increased. Finally, a stage is reached when its contribution to the bubble vol¬ 
ume becomes negligible. 

c. For highly viscous liquids, as the flow rate is increased, the surface 
tension loses its importance at much smaller flow rates than those for inviscid 
liquids. 

In order to verify the importance of surface tension, a large amount of 
data has been collected for bubble formation in liquids having different sur¬ 
face tension but nearly constant viscosity. Liquids of both low and high 
viscosity have been used. The equation is seen to agree excellently with the 
data obtained over a wide range of flow rates. It was also observed that for 
highly viscous liquids the surface-tension effects become negligible at much 
smaller flow rates. 

The existence of a considerable discrepancy'in the results reported by 
various investigators, has already been mentioned. Datta et aL (D4), Coppock 
and Meiklejohn (C8), and Benzing and Myers (B3) find increase in surface 
tension to increase bubble volume (Table I). All these investigators have used 
liquids of low viscosities and extremely small flow rates (about 0.008 cm 3 /sec). 
Under these conditions the influence of drag terms during both the stages is 
very small (<5%), and hence the influence of surface tension shows up 
strongly. Table V compares the experimental data of several investigators 
with the theoretical values calculated from the general model. The model is 
seen to predict the actual trends well. 

Quigley, Johnson, and Harris (Ql) find that the surface tension has no 
effect on the bubble size. A comparison of their data with the present general 
model is not possible because they conducted their experiments under con¬ 
stant pressure, not constant flow, conditions. However, the flow rates em¬ 
ployed by them are high; in such cases, even for constant flow conditions, 
the influence of surface tension is small. 
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TABLE V 

Comparison of Experimental Data of Various Investigators with the 
Values Calculated from General Model Showing the 
Influence of Surface Tension 0 


Surface 

tension 

(dyn/cm) 

Density 

Pi 

(gm/cm 3 ) 

D 

(cm) 

Q 

(cm 3 /sec) 

Bubble volume (cm 3 ) 

Experimental 

Calculated 



1. Datta et al. (D4) 


73.10 

0.9997 

0.036 

0.0122 

0.0292 

0.0346 

64.60 

0.9954 

0.036 

0.0099 

0.0237 

0.0306 

49.96 

0.9844 

0.036 

0.0083 

0.0199 

0.0240 

32.39 

0.9518 

0.036 

0.0054 

0.0129 

0.0160 

22.88 

0.7936 

0.036 

0.0052 

0.0126 

0.0136 

73.10 

0.9997 

0.630 

0.0446 

0.1071 

0.1552 

64.60 

0.9954 

0.630 

0.0412 

0.0990 

0.1378 

49.96 

0.9844 

0.630 

0.0410 

0.0985 

0.1086 

32.39 

0.9518 

0.630 

0,0400 

0.0960 

0.0737 

22.88 

0.7936 

0.630 

0.0398 

0.0955 

0.0629 


2. Coppock and Meiklejohn (C8) 


33.4 

0.690 

0.061 

0.025 

0.0065 

0.0110 

54.0 

0.989 

0.061 

0.025 

0.0100 

0.0123 

72.0 

1.000 

0.061 

0.025 

0.0133 

0.0159 

33.4 

0.690 

0.091 

0.025 

0.0095 

0.0159 

54.0 

0.989 

0.091 

0.025 

0.0140 

0.0178 

72.0 

1.000 

0.091 

0.025 

0.0185 

0.0231 

33.4 

0.690 

0.102 

0.025 

0.0107 

0.0177 

54.0 

0.989 

0.102 

0.025 

0.0148 

0.0198 

72.0 

1.000 

0.102 

0.025 

0.0212 

0.0258 



3. Davidson and Schuler (D9) 


72.7 

1.000 

0.0668 

0.5 

0.0260 

0.0333 

72.7 

1.000 

0.0668 

1.0 

0.0365 

0.0481 

72.7 

1.000 

0.0668 

1.5 

0.0365 

0.0632 

72.7 

1.000 

0.0668 

2.0 

0.0500 

0.0789 

72.7 

1.000 

0.0688 

2.5 

0.0680 

0.0952 

27.1 

0.810 

0.0688 

0.5 

0.0090 

0.0215 

27.1 

0.810 

0.0688 

1.0 

0.0200 

0.0350 

72.7 

1.000 

0.4000 

5.0 

0.2000 

0.3106 

72.7 

1.000 

0.4000 

10.0 

0.4200 

0.5210 

72.7 

1.000 

0.4000 

20.0 

0.9000 

0.9897 

72.7 

1.000 

0.4000 

30.0 

1.3000 

1.5068 

27.1 

0.810 

0.4000 

5.0 

0.2000 

0.2306 

27.1 

0.810 

0.4000 

10.0 

0.4200 

0.4348 

27.1 

0.810 

0.4000 

20.0 

0.8500 

0.9007 

27.1 

0.810 

0.4000 

30.0 

1.1500 

0.4173 


Viscosity, /x = 0. 
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Davidson and Schuler (D8) and Siemes and Kaufmann (SI 1) also report a 
negligible influence of surface tension. These authors have used low orifice 
diameters and highly viscous liquids, both of which reduce the surface tension 
effects. For very small flow rates, however, the surface tension is effective 
even for highly viscous liquids, as found by Krishnamurthi, Kumar, and 
Kuloor (K8). 

Thus, the apparently inconsistent results of various investigators regard¬ 
ing the influence of surface tension on the bubble volume are explainable by 
the difference in the experimental conditions employed. 


2. Effect of Viscosity 

There exists the maximum amount of contradiction regarding the role of 
viscosity in bubble formation. The present model indicates that for both the 
stages of expansion and of detachment— 

(i) the effect of viscosity is large at higher flow rates, 

(ii) the effect of viscosity is large for liquids of low surface tension and 
where orifices of small diameter are used. 

Thus, if the liquid is highly viscous, then the drag force is so predominant 
that the bubble volume is highly sensitive to viscosity. Again, if the viscosity 
and the flow rate are both small, then variations in the viscosity do not affect 
the bubble volume appreciably. 

The actual data collected for liquids of various viscosities are presented in 
Fig. 14. The solid lines represent the theoretical values in the figure. 

To recapitulate the discrepancies in literature, Datta et ah (D4) varied the 
viscosity of water from 0.012 to 1.108 poise and found that with an increase in 
the viscosity, the bubble volume decreased for all the nozzles used. This is in 
apparent contradiction to the observations of most of the other investigators. 
An effort can now be made to explain this discrepancy on the basis of the 
present model. Note is to be made of the extremely small volumetric flow 
rates employed by Datta et ah (D4). In fact, they are in the range where 
effects due to viscosity are negligible when compared to the effects of surface 
tension. Thus, though there is a hundredfold increase in the viscosity, it is 
accompanied by a large variation in the surface tension, which decreases from 
72.8 to 65.7 dyn per centimeter. At the very small flow rates employed, the 
decrease in the bubble volume observed by Datta et ah (D4) seems more 
likely to be due to this decrease in the surface tension rather than to the 
hundredfold increase in the viscosity. Thus, the influence of surface tension 
has been mistakenly attributed to the effect of viscosity. The actual values of 
the bubble volumes obtained by these authors for a typical nozzle are given 
in Table VI along with those obtained by the application of the present model. 
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Fig. 14. Comparison of the general model (Rl) with the data collected for bubble 
formation in viscous liquids. 


It is interesting to note that the model predicts a decrease in the bubble volume 
in spite of the hundredfold increase in the liquid viscosity under the condi¬ 
tions of the experiments conducted by Datta et ah (D4). 

On the other hand, Coppock and Meiklejohn (C8) find that the viscosity 
has negligible influence on the bubble volume. These authors have used liquids 
of very low viscosity and extremely small flow rates where the effect of 
viscosity is negligible. Calculations made for a set of data of Datta et al (D4) 
are presented in Table VI. The predictions made by the general model are 
again seen to bear with the trends found experimentally. 

The range of variables investigated by Davidson and Schuler (D8) is once 
again such that the influence of viscosity is predominant. This has already been 
discussed in the section dealing with the effect of surface tension. 

3. Effect of Density 

Only two main opinions merit our attention in this case. They are: (i) the 
bubble volume decreases with an increase in the liquid density, and (ii) the 
density has no effect. 

Referring to Eq. (55), it is easily seen that if both and Q are small, that 
of surface tension and that of buoyancy are the only important forces, and an 



TABLE VI 

Comparison of Experimental Data of Datta et al. (D4) with the Values Calculated from General Model 

Showing the Influence of Viscosity 


Vis¬ 

cosity 

P 

(poise) 

Surface 

tension 

Y 

(dyn/cm) 

Density 

Pi 

(gm/cm 3 ) 


D — 0.036 cm 

D 

= 0.141 cm 

D — 0.388 cm 

Q 

(cm 3 /sec) 

Bubble volume 
(cm 3 ) 

Q 

(cm 3 /sec) 

Bubble volume 
(cm 3 ) 

Q 

(cm 3 /sec) 

Bubble volume 
(cm 3 ) 

Experi¬ 

mental 

Calcu¬ 

lated 

Experi¬ 

mental 

Calcu¬ 

lated 

Experi¬ 

mental 

Calcu¬ 

lated 

0.012 

72.8 

0.9994 

0.00810 

0.0072 

0.0107 

0.06083 

0.0292 

0.06114 

0.20500 

0.0984 

0.20723 

0.154 

68.3 

1.1700 

0.00787 

0.0070 

0.0077 

0.05208 

0.0250 

0.03188 

0.18120 

0.0870 

0.09653 

0.235 

67.6 

1.1850 

0.00787 

0.0070 

0.0077 

0.05104 

0.0245 

0.03121 

0.18120 

0.0870 

0.09137 

0.497 

66.4 

1.2100 

0.00765 

0.0068 

0.0075 

0.04812 

0.0231 

0.03114 

0.17170 

0.0850 

0.08762 

1.108 

65.7 

1.2200 

0.00765 

0.0068 

0.0074 

0.04583 

0.0220 

0.03006 

0.17500 

0.0840 

0.08645 
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increase in the density brings about a decrease in the bubble volume. Similar 
reasoning is applicable to the detachment stage also. If, on the other hand, Q 
is very large and fi is negligible, then the second and the third terms on the 
right-hand side of Eq. (55) are small when compared with the first term, and so 
the bubble volume during expansion is independent of the liquid density. The 
same is true for the detachment stage also. Thus, the liquid density is non- 
effective for liquids of low viscosity and for high flow rates. This situation 
exists in the experiments carried out by Quigley et al (Ql). A quantitative 
comparison is, however, not possible because their experiments have been 
conducted under constant pressure conditions. 

On the other hand, if the first and third terms on the right-hand side of 
Eq. (55) are small (which is true for highly viscous liquids), the density still 
appears in the equation and its increase results in a decrease in the bubble 
volume. This situation is applicable to the experiments of Davidson and 
Schuler (D8). Since no data are reported which pertain to the effect of density 
alone, a comparison is not possible. 


V. Bubble Formation under Constant Pressure Conditions 

The constant pressure condition arises when the chamber volume tends to 
infinity (in practice, more than about a liter), and the pressure in the gas 
chamber remains constant. As the pressure in the bubble varies with the 
extent of its formation, the pressure difference across the forming device also 
varies, thereby bringing about a condition of changing flow rates. 

A. Correlation of Hayes, Hardy, and Holland (H4) 

These authors have assumed the bubble to be expanding at the orifice, and 
have used the force balance equation at the time of detachment. The various 
forces considered by these authors are buoyancy, force due to the addition of 
mass (P2), excess pressure force, surface tension force, drag force, and force 
due to the inertia of the liquid. 

Considering all the above forces, and applying Newton’s second law of 
motion, we obtain 

dm x / nD 2 \ dv d(mv) 

v o-Jf+ vmg + J(Pi — p) — F + nDy — m — = (62) 

To obtain bubble volume by Eq. (62), the various terms are evaluated at a 
time just prior to the release of the bubble. Thus, 


(Pi ~P) = 4y/d 


(63) 
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Further, 


d(mv) dm 
dt 0 dt 


~^Q 2 P 9 

nD 2 




(64) 


The drag of the bubble is expressed in terms of the drag coefficient as 

F= C D p l v 2 A f /2 (65) 

Assuming the bubbles to be spherical, the frontal area A f of the bubble 
becomes nd 2 j4. Further, by making use of the expression v — Qjnd 2 , the 
drag force can be written in terms of Froude number, the drag coefficient, and 
the volume of the bubble. Thus 


F = (C D p,/2)(Q 2 jji 2 d 4 )(nd 2 /4) (66) 

= 3g Pl C D N Fr V/4 

where 

N Fr = v 2 jdg = Q 2 /n 2 d 5 g 


In Eq. (66), in addition to F, C D is also unknown; it has been empirically 
found by the authors from the experimental data on bubble formation. 

The force due to the movement of the liquid surrounding the bubble is 
m\dvjdt). For a sphere moving in an infinite medium of an inviscid fluid, the 
mass of the liquid m' is equal to half the mass of the displaced liquid. The 
authors, however, assumed merely a direct proportionality between m f and 
the mass of the displaced fluid, instead of the above relationship, because they 
considered their flow not to be irrotational. 

The force can be expressed as, 

m{dvjdt) = t Vp i(dv)dd^{ddI dt) (67) 

where r is the proportionality factor, found empirically. 

On evaluating various terms of Eq. (67) and substituting, we obtain 

fri(dv/dt) = t p t V(—2Qlnd 3 )(2Qlnd 2 ) (68) 

= —4p l gzN jFr V 

Now all the terms of the basic Eq. (62) have been evaluated; hence it can 
be written as follows: 

At the time of detachment, when V = V F , 

V f (Ap)g( \ -ip) = nDy(l - D/d) - <j> (69) 

where 

$ = (3p,/4 A p)N Fr (C D - 16t/3) 

<P = m 2 9jnD 2 )\\ - (1/12)(Z)A/) 2 ] 


and 



306 


R. KUMAR AND N. R. KULOOR 


Equation (69) contains unknown quantities like i// 9 which have to be 
determined either theoretically or empirically. The term (1 — \j/) is a function 
of both the Reynolds number and Froude group, and presumably can be 
correlated as a power relationship of the form 

|(1 - *)| = K(N Re ) a (N Fr ) b (Dld) c (70) 

Further, Eq. (69) gives rise to two types of situations depending on whether 
nDy{\ — D/d) is greater or smaller than </>. Since d is larger than D, the term 
(1/12 )(D/d) 2 can be neglected in </>, and so </> is simply equal to (4Q 2 p g /nD 2 ), 
which is the time rate of change of momentum of the gas added to the bubble 
just prior to its release. 

The authors find that for nDy(\ — D/d) > </>, the bubble volume remains 
essentially constant, whereas for nDy{\ - D/d) < <j >, the frequency remains 
essentially constant, and the extra flow goes to increase the bubble volume. 
Thus, two relationships are given, corresponding to each of the above two 
conditions. The following correlations are obtained by the solution of Eqs. 
(69) and (70): 


For nDy (1 - D/d) > </>: 


Since for this case the bubble volume remains essentially constant but the 
frequency changes with the flow rate, the authors have considered frequency 
as the dependent variable in the following correlation: 


7 SA v 10 -3 T DN o 1 1 _0 * 697 

f — iU / y N-°' 5 \-° A12 (NONl ^°' 525 -_ — _ 

; JVL (j Fr ' Re } { Q Re) L(6Q/7r) 1/3 J 


where 


n 1/3 

N Fr = N Fr f-^ = -J 


N Re = N Re f~ ll3 = 


gn 1,3 6 5/3 

Q 2l3 p l 

n 2l3 6 l ' 3 n, 


min) 1/3 \ 

Modified Froude group 
Modified Reynolds number 


(71) 


and 




mg 


lnDy(l - D/d) - 0 . 


For nDy(l — D/d) < (j>: 

Here, the bubble volume has been assumed to be the dependent variable 
and the correlation is: 

V P = (23AIN)(N Fr N 5/3 ) 0A63 (N' Re N 1,3 )~°' 016 \DN 1 /3 ) 0-712 (72) 


Nfr = N Fr d 5 


where 
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and 

N' Re = N Re d 

The relationships (71) and (72) were tested by their proposers, who re¬ 
ported that the average percentage deviations of the calculated values of / 
and V F from the experimental values are 16 and 10, respectively. 

B. Davidson and Schuler’s Model (D8, D9) 

The analysis used by these authors for constant flow conditions has been 
extended to constant pressure conditions also. The major change arises from 
the fact that now the flow is a function of the extent to which the bubble has 
already been formed. This has been introduced into the constant flow equa¬ 
tion by means of an orifice equation. 


1. Bubble Formation in Inviscid Fluids 


The basic equation here is the same as that for constant flow conditions, 
involving the instantaneous movement of the bubble. Thus: 




(73) 


The flow rate Q , at which the bubble volume is changing, is evaluated by 
the equation 


Q = dV/dt = K(F + p l9 s - 2y/r) l/2 (74) 

where 


P = (A - Pidh) 


In Eq. (74), K , an experimental constant depending on the orifice, is 
determined by experiments with a steady state flow of gas through the orifice 
in the absence of the liquid. 

Equations (73) and (74) taken together were solved (D8, D9) with the aid 
of a computer through numerical methods. The initial condition used was that 
at t = 0: 

s = 0, ds/dt ~ 0, and V 0 = 4nR 3 /3 

The final bubble volume V F is evaluated by assuming the detachment to 
take place when s = r F + R. 

The influence of surface tension of the system and of the orifice radius 
were found to be small. Surface tension, however, has an effect on the mini¬ 
mum value of P for bubbling to occur, since bubbling ceases if P is less than 
2 y/R. 
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The equations were verified mainly with water as the experimental liquid. 
A part of the results, as given by the authors in tabular form, is reproduced in 
Table VII. From Table VII it is evident that the agreement between the 
theory and experiment is not very good. 


2. Bubble Formation in Highly Viscous Liquids 


Two separate sets of equations have been derived by Davidson and 
Schuler, one for small and one for large flow rates. Both cases are discussed 
below. 

a. Small Flow Rates. The equation of the motion of the bubble remains the 
same as for constant flow conditions and corresponds to Stokes’ equation, i.e., 

v = 2gr 2 p,/9p l (75) 

The final equation, by taking the variation in flow rate into account, becomes: 


KP l/2 t = (47i/3)(2y/P) : 


tan 0 (sec 5 0 + 7 sec 3 0 + 


T s “ 9 ) 

+ J ,,n (i + i)] <76) 


where 


6 = cos" 


4 71 \: 

Tv)' 


4ti\ 2v 
P 


1/2 


V and r can be evaluated for any value of t , from Eq. (76). But the equation 
cannot directly give the final bubble volume V F , for evaluating which it is 
necessary to plot v versus t , by making use of Eqs. (75) and (76). The distance 
^ travelled at any time t can be found from the plot by graphical integration. 
From a plot of s versus r + r 0 , the value of r + r 0 corresponding to any 5 is 
obtained, from which, in turn, r F and V F are calculated. 

The above equations were verified by using aqueous glycerol solutions 
(D8). The model gives results which agree reasonably well with the experi¬ 
mental data. 


b. High Flow Rates. Here the inertia of the surrounding liquid is taken 
into consideration in the movement of the bubble. 

The equation of motion now assumes the form 


Vg — 6n — (37/47c) 1/3 
Pi 


ds 11 v— --- — 
dt + 16 dt 2 + 16 dt dt 


(77) 


The equation of flow through the orifice is 

dV/dt = K(P - 2 y/r - p t gs ) 1/2 


(78) 



TABLE VII 


Data on Formation of Air Bubbles in Water under Constant Pressure Conditions 


S.No. 

D 

(cm) 

K 

(ml • cmVgm*) 

P 

(gm/cm • sec 2 ) 

Mean flow rate Q (cm 3 /sec) 

Bubble volume V (cm 3 ) 

Expt. 

(D9) 

Theory 

Expt. 

(D9) 

Theory 

Eq 

(73), (74) 
(D9) 

Eq 

(89) 

(S3) 

Eq 

(73X74) 

(D9) 

Eq 

(89) 

(S3) 

1 

0.298 

1.90 

951 

32 

67 

65.5 

2.3 

3.5 

3.29 

2 

0.298 

1.90 

1118 

45 

70 

68.0 

2.9 

3.8 

3.52 

3 

0.298 

1.90 

1323 

61 

76 

73.2 

3.4 

4.2 

3.78 

4 

0.374 

3.06 

779 

33 

102 

86.4 

3.4 

6.1 

5.89 

5 

0.374 

3.06 

877 

47 

105 

89.7 

4.1 

6.4 

6.13 

6 

0.374 

3.06 

1024 

60 

112 

93.9 

4.5 

6.9 

6.47 

7 

0.412 

3.82 

734 

30 

124 

109.0 

4.3 

7.8 

7.88 

8 

0.412 

3.82 

832 

57 

129 

113.0 

4.9 

8.3 

8.18 

9 

0.412 

3.82 

1006 

68 

141 

118.2 

5.7 

9.1 

7.98 

10 

0.460 

4.90 

632 

25 

156 

135.0 

5.6 

10.7 

10.73 

11 

0.460 

4.90 

739 

60 

163 

140.0 

6.9 

11.4 

11.18 

12 

0.460 

4.90 

790 

68 

169 

142.8 

7.1 

11.7 

11.39 


o 

VO 
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Equations (77) and (78) cannot be solved analytically. A computer was 
used for obtaining the values of s, F, and dsjdt for regularly increasing values 
of t : The initial values taken were 

at / = 0, s = 0, V=(4nR 3 /3), and (ds/dt) = 0 

The volume V F of the bubble moving away from the orifice was taken to be 
the difference of the total volume at s = r F + R, and the initial volume 
F 0 = 4nr 3 j3. The volume F 0 formed the nucleus of the succeeding bubble. The 
model gives good agreement up to a flow range of 60 cm 3 /sec. 

C. Model of Satyanarayan, Kumar, and Kuloor (S3) 

These authors have extended the concepts developed by Kumar and 
Kuloor (K16, K18, K19) for bubble formation under constant flow condi¬ 
tions to the situations of constant pressure conditions. 


1. Bubble Formation in Inviscid Fluids 

As before, the force balance bubble volume is first found, and its volume is 
then used to determine the final bubble volume. 

The procedure is the same as for constant flow conditions. The only forces 
to be considered are the inertia force, the surface-tension force, and the 
buoyancy force. The viscous effects are neglected. Making a force balance, we 
obtain: 


VfbiPl - Pg)9 = 


d(M Ve) 
dt e 


+ 2nRy cos 9 


(79) 


The term d(Mv e )/dt e must be determined before the above equation can be 
used for evaluating V fb . Thus 


Further, 


d(Mv e ) 

dt e 


dv e dM 

= M — -b v e —— 

dt„ dt P 


v e = dr/dt e = Q/4nr 2 


(80) 

(81) 


Q is not a constant in this case; it depends on the stage of bubble formation. 
It can be determined from orifice equation, as 


where 


Q = K(P + Pl gr-2ylry 12 
P — {Pi~ Pi 9 h) 


(82) 
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Substituting Eq. (82) in Eq. (81), we obtain 

K(P- Pl gr-2ylryi 2 

4nr 2 


(83) 


Thus, 

dVe_K 

dt „ 8 n 


< JT = r-( Pr * + Pi9 r 3 ~2yr 5 ) 1/z (-4 Pr s -3p,gr 4 


dr 


+ 10yr -6 ) — (84) 
at„ 


But ( dr/dt e ) = v e , and v e is known from Eq. (83). Thus, 
dv. -K 2 


dt. 32 n 2 r 6 


(4 Pr + 3pigr 2 - 10y) 


Further, 


Therefore, 


dM 
dt, 


= K(p g + ^ 6 p^(P + p ig r-2ylry' 2 
dM _ K 2 [p g + (l_l/16)pj] ( p + Pi g r _ 2y/r) 


(85) 


( 86 ) 


(87) 


e dt e ~ 4 nr 2 

The various terms of Eq. (79) have now been evaluated so it can be written 


as 


2nRy cos 6 + 


g 2 [p g + (ll/16)p,] 
4nrj b 


(P +Pi gr fb -2yjr fb ) 


(V fb [p g + (nil6)p,-]K 


32tt 


r fb 


■j(4Pr fb + 3 Pl grj b - lOy) 

= V fb (p t -p g )g (88) 


Expressing r fb as (3/47i) 1/3 F /6 1/3 in Eq. (88), we have 

C,V- fb 2l \P + 0.6203p^K}i 3 - 3.222VJ b ,3 y) 

- C 2 V} b \2.4%2V\ l2 P - 1.149 Pl gV 2 y b 3 - lOy) 
+ 2nRy cos 0 ~ V fb (p, - p g )g 


(89) 


where 


„ _K 2 [p„ + (ll/16)p ( ] 
— 


4n(3l4n) 2 ' 3 ~ {)A422K 
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and 


c 2 


K\ P g + (H/l6) Pi ] 
327T 2 (3/4tc) 2 


0.03822K 2 /?, 


Equation (89) represents an implicit relationship between the force balance 
bubble volume and the other variables. 

To evaluate V F , it is necessary to analyze the second stage of the bubble 
formation where the expanding bubble is moving away from the orifice base. 
The equation of motion for this case differs from that for constant flow con¬ 
ditions in that Q here varies with time. The orifice equation, however, shows 
that Q in this stage varies very little. It is evident that at higher values of r, 
the variation of the bubble volume at a definite rate causes increasingly small 
variation in r. As the change in the flow rate is due mainly to the variation 
in r for a particular set of conditions, it can be considered to be negligible when 
the change in r is small. Since this condition holds good in the second stage, 
Q can be assumed to be constant, its value being Q fb , which is the flow rate 
at the end of the force balance bubble stage. 

The equations for the constant flow rate have already been derived in 
Section IV for various situations. For inviscid fluids with surface tension, the 
equation is reproduced below. 


= ^q( V f 2 - V f») - ^ " V ») 

^ (jV :b + 3 MV 1 /, 3 - K^(ln V F - In V /b ) (90) 

where 

p , ( Pi ~ Pg}9 

Q[p g + {nii6)pi] 

N = _ Q _ 

127t(3/47t) 2/3 
nDy cos 9 

ana 3 — . _ 

QlPg + 

r fb is evaluated by making use of Eq. (89). Using this value of r fb , and 
the corresponding Q fb in Eq. (90), the value of V F can be determined. 

From Eq. (89), the surface tension is seen to influence the bubble volume 
in two ways, viz. by varying the flow rate in the expanding bubble, and by 
causing a downward force at the periphery of the orifice. Even if the effect of 
the surface tension becomes negligible at the orifice tip due to either the small 
size of the orifice or the nonwettable character of the system, it still influences 
the bubble volume because of the variations in flow during the bubble 
formation. 
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The Eqs. (89) and (90) get simplified, when the surface tension effect at 
the orifice tip is small, to 

C,V- fb 2 '\P + 0.6203 p g gvy b 3 - 3.22V}„ t/3 y) 

- C 2 K;,‘(2.483 V}l 3 P - U49 Pl gV}i 3 - lOy) 

= VfbiPl - Pg)9 (91) 

and 

r /b = ^ (V F 2 - Vj b ) - ^ (vy 3 - vy 3 ) 

+ ^ (lNVy b 3 - J Vj^ln V F - In V fb ) (92) 

Equations (89)-(92) have been verified by the authors, both for the final 
overall values and for the influence of individual variables. Their typical 
results for inviscid fluids are presented in Fig. 15, which shows a good agree¬ 
ment between the theoretical and the experimental values. The authors have 
obtained higher bubble volumes than those of Davidson and Schuler (D9) 
under otherwise similar conditions. This is probably due to absence of liquid 
circulation in the present condition. 



P = (f>-ifgh) 

Fig. 15. Comparison of the model (S3) with the data collected for bubble formation in 
inviscid liquids under constant pressure conditions. 
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2. Satyanarayan , Kumar , and Kuloor Model for General Situation (S3) 


The above discussion dealt with only that particular situation where the 
continuous phase approximated to an inviscid fluid. However, the equations 
thus derived can be easily modified to include the effects of viscosity of the 
of the continuous phase. Under constant pressure conditions also, viscosity 
of the continuous phase tends to increase the bubble volume by increasing 
the drag during both the expansion and detachment stages. 

All the forces have already been considered except the viscous drag which 
is given by 

6nru , _ 

Viscous drag = -—^ K(P + p,gr - 2y/r) 1/2 (93) 

4nr 


At the end of the expansion stage, when the bubble volume is and its 
radius r fbi the force balance becomes: 


Vfbipl - Pg)9 


K 2 [p g + (U!l6)pJVJ h 2 < 3 
4rt(3l4n) 213 


P + Pl g(3l4ny l3 V;i 3 


27VJl1 1 VfjK 2 [p t + (11/16 )p^VJl 

(3/4tt) 1/3 J 327i 2 (3/4ti) 2 

[4P(3/47i) ,/3 F}£ 3 - 3 Pl g(3l4n) ll3 V% 3 - lOy] 


3 KpVjiy_ 
2(3/4tt) 1/3 


P + p l9 (3l4rt) il3 vy b 3 


2y l 1 / 2 

{3i4Kyi 3 vy 3 \ 


+ 2nRy cos 6 


(94) 


The volumetric flow rate Q remains essentially constant, since the changes 
in the bubble radius are quite small in this stage. The value of this Q can be 
taken as Q fb 5 which is the flow rate at the end of the expansion stage. The 
analysis for this stage, therefore, is the same as that for constant flow con¬ 
ditions. Thus: 


where 


r/b ~ 2Q(A + 1 ) (Vp2 ~ V}b) V/b) 


3 C 


2Q(A - 1/3) 


(V? 13 ~ V}' 3 ) ■ 


3D 


Q(A - 2/3) 


W 3 - V}1 3 ) 


A 


1 + 1.25 


96nr fb p 

U Pl Q 


(95) 


B«i6gf/ne 
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2(3l4n)^l Pg + ( ll/16)pj 


127i(3/47r) 2/3 

and 


16nDy cos 9 

11 QPi 

The set of Eqs. (94) and (95) are quite general in nature, and are applicable 
to bubble formation under constant pressure conditions, in the range of 
single bubbling. 

If the viscosity term is removed, then these equations reduce to the ones 
given earlier for in viscid fluids. Thus, these general equations are applicable 
to inviscid fluids, both with and without the surface tension at the orifice tip. 

Verification of the equation for viscous liquids is shown in Fig. 16, where 
the solid lines correspond to theoretically calculated values. 



Fig. 16. Comparison of the model (S3) with the data collected for bubble formation in 
viscous liquids under constant pressure conditions. 
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The range of variables in which the equations have been tested is given 
in Table VIII. The agreement in overall range is found to be good. 

TABLE VIII 


Range of Variables Covered for Testing the Model of Satyanarayan, 
Kumar, and Kuloor (S3) 


Orifice 

Surface 

Viscosity 

Density 

Flow rate 


diameter 

tension 

P 

Pi 

Q 

P 

D 

(cm) 

y 

(dyn/cm) 

(cp) 

(gm/cm 3 ) 

(cm 3 /sec) 

(gm/cm • sec 2 ) 

0.051-0.405 

35-72 

90-600 

0.990-1.24 

0-260 

500-25,000 


VI. Bubble Formation in Non-Newtonian Fluids 

The study of bubble formation in non-Newtonian fluids has not been re¬ 
ported in literature in spite of the great industrial uses of these fluids. Recently, 
Subramaniyan and Kumar (S16) have studied bubble formation under constant 
flow conditions in fluids following the Ostwald-de-Waele rheological model. 
The model of Kumar and Kuloor (K16, K18, K19) has been extended to take 
into consideration the drag variation caused by the complexity of the rheo¬ 
logical equation. 

For Ostwalde-de-Waele fluids, 

t = — {A:|[(1/2) A : A] 1 / 2 r 1 }A (96) 

The creeping flow of a sphere in a fluid following Eq. (96) has been ex¬ 
tensively studied (SI3, T2, Wl, W2). The drag coefficient is given by 

C D = 24 XjRe' (97) 

where Re' is the modified Reynolds number, defined as 

Re' ~ {2r) n p x v 2 ~ n jk (98) 

X n is a function of the flow index n. For Newtonian fluids, the value of X n 
is unity. 

The total drag on the sphere worked out through Eq. (97) becomes 

F 24nX " k 
D 2 n+ 1 r n ~ 2 v~ n 


( 99 ) 
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The force balance bubble volume equation, on the introduction of the 
drag term given by Eq. (99) becomes 


V fb A pg = nDy cos 6 + 


8 2 [ Pg + (i 

127r(3/47r) 2/3 


24X„ kQ n Vr b (3n ~ 2)l3 

+ 2 3 " +1 n" -1 (3/47t) <3B - 2)/3 (100) 

The values of n and k are evaluated from Capillary Viscometer data. 
The upper and lower bounds of X n are known (W2). The mean of the two 
bounds is used as the actual X n value for computational purposes. 

The final equation for the second stage now becomes 


~ + Y + AT-°' +i)t3 v n = B - CT~ 5/3 - DT- (3n+l)l3 - ET~ l 


( 101 ) 


where 


, _ 24 nX n k 

~ 2" +1 (3/47r) ( "- 2)/3 [p a + (ll/16)pje 

d (pj - 

[fig + (H/16)Pj]6 

C- g — 

127r(3/47i) 2/3 

_ 24XJ<Q^_ _ 

2 3n+1 7c"- 1 (3/47i) (3 "- 2)/3 [p 9 + (11/16)/?,] 
nDy cos 9 

£ = [/? 9 + (ll/16)/?,]Q 

Equation (101) is numercially solved by the Runge-Kutta method. The 
agreement between the experimental values and theory is seen from Fig. 17, 
where the solid lines indicate the values obtained by theory. The liquid used 
for the data presented in Fig. 17 had n — 0.617 and k ~ 0.0059. 

It is possible to frame equations for various models of non-Newtonian 
fluids by incorporating the adequate drag terms into (100) and (101), but 
the experimental data for verifying these equations are still not available. 
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Fig. 17. Comparison of the model (SI6) with the collected data for bubble formation 
in non-Newtonian liquids under constant flow conditions. 


VII. Bubble Formation in Gas Fluidized Beds 
A. Introduction 

The two-phase theory of aggregative fluidization considers the bed to be 
made up of two parts, viz. (i) a particulate phase wherein the gas flow rate 
corresponds to that required for incipient fluidization, and (ii) a bubble phase 
which conveys the extra gas through the bed in the form of bubbles. This theory, 
which is applicable up to flow rates corresponding to approximately 10£/ o , is 
supported by many investigators (B2, M6, R8, T3, Yl). The bubbles which 
are formed at the distributor rise through the bed, exchanging the fluidizing 
gas and undergoing coalescence, rupture, etc. It is necessary to determine the 
sizes of the bubbles produced at the distributor, as their size distribution is 
important in a study of bubble dynamics, and also the extent of transfer 
operations and chemical reaction occurring in the bed. In order to understand 
the mechanism of bubble formation, single nozzles are normally employed. 

The investigations of Davidson et al (D7) enable a theoretical analysis 
of the phenomenon of bubble formation in fluidized beds. They found that, 
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as far as the movement of the bubbles is concerned, the fluidized bed behaves 
like an inviscid fluid with zero surface tension. This has been confirmed by 
Bloore and Botterill (B5) who compared their results with those obtained by 
Calderbank (Cl) for an air-water system and concluded that there is a marked 
similarity in the behavior of the gas bubbles in liquid systems and in fluidized 
systems, although the actual mechanism of bubble formation in these two 
systems must be quite dissimilar. In these experiments the bubbles were 
formed by injecting a measured stream of gas through a nozzle placed in a 
gently fluidized bed of ballotini (0.8 mm diam.). 

The authors (B5) report that for flow rates between 50 and 250 cm 3 /sec, 
the bubble frequency remains essentially constant at 22 bubbles per second, 
whereas at lower flow rates, the frequency shows a marked tendency to rise. 

A more thorough experimental investigation was carried out by Harrison 
and Leung (H3). 

The conclusions drawn are as follows: 

1. The bed depth has no influence on the size of the bubble produced. 
This indicates that the bubbles are formed under either constant flow or con¬ 
stant pressure conditions. In the intermediate region, Padmavathy, Kumar, 
and Kuloor (PI) have shown that the bubble volume in an air-water system is 
highly sensitive to the variation in the depth of the liquid column above the 
bubble forming nozzle. As the bed has no surface tension, no variation of 
flow is expected during bubble formation, and the conditions of constant flow 
are approximated. This explanation is due to present authors. 

2. The particle size of the solids does not influence the bubble size. 

3. The bubble frequency is independent of the main fluidizing velocity. 

4. The bubble formation in fluidized beds follows a trend very similar 
to that for bubble formation in inviscid liquids. 

From the above discussion, it is seen that the bed acts as an inviscid fluid 
without surface tension, and that the bubble formation takes place under 
constant flow conditions. Thus, the theories which are applicable to the above 
conditions should also be applicable to bubble formation in fluidized beds. 

B. Model of Davidson and Harrison (D6) 

These authors have modified the model of Davidson and Schuler (D9). 
The modification is in the value of liquid being carried with the bubble, which 
is assumed to be 1/2 the bubble volume, instead of the earlier 11/16th the 
bubble volume. The resulting equation for the bubble volume is now written as 

v, = V' 5 ) (102) 

The authors have tested this equation with the data of Harrison and 
Leung (H3) and those of Bloore and Botterill (B5) and found a reasonably 
satisfactory agreement between the experimental and the calculated values. 
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C. Model of Kumar and Kuloor (K17) 


This model is a modification of the model developed by Kumar and 
Kuloor (K18) for bubble formation in inviscid fluids in the absence of surface- 
tension effects. The need for modification arises because the bubble forming 
nozzles actually used to collect data on bubble formation in fluidized beds 
differ from the orifice plates in that they do not have a flat base. Under such 
conditions the bubble must be assumed to be moving in an infinite medium 
and the value of 1/2 is more justified than the value 11/16. 

The force balance bubble volume at the end of the expansion stage is 
given by 


v fb = 


247t(3/47t) 2/3 ^J 


3/5 


(103) 


In Eq. (103) the value of 1/2 has been used for evaluating the virtual mass 
of the bubble. 

The final equation for the bubble volume, which results from the analysis 
of the second viz. the detachment stage, is 

(3/47t) 1/3 (F /fc ) 1/3 = (P/4Q)(F* - V 2 fb ) - (9NIQ)(V l F 13 - V'/ 3 ) 

+ (l/fi)[3 NV 1 / 3 - (P/2)F^](ln V F - In V fb ) (104) 

where 


p (Pi ~ P g )9 rj 2 g 
Q(Pg + ipl) Q 
and 


12rt(3/4re) 2/3 

The authors (K17) find that Eq. (104) explains the data of Bloore and 
Botterill (B5) and Harrison and Leung (H3) quite well. 

On introducing various simplifications, Eq. (104) becomes 

V F = 0.806 (105) 

Though both the above equations represent the data with reasonable 
accuracy the conditions present at the distributor agree better with the as¬ 
sumption that 11/16th of the volume of the liquid is being carried by the 
bubble. It is therefore recommended that, for calculation of bubble formation 
at the distributor, Eqs. (22) and (28) be used. 
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VIII. The Effect of Orifice Geometry on Bubble Size 

Circular orifices are normally employed in the study of bubble formation 
at submerged orifices because they admit an accurate assessment of the sur¬ 
face-tension effect. In industrial operations, however, the use of noncircular 
slots for bubble formation is quite frequent. A few investigators (C5, G2, J2, 
K4, S10, V4), who used porous solids for bubble formation have employed 
the average pore size for purposes of correlation. Spells and Bakowski (S15) 
and Cross and Rider (C9) have used rectangular slots for bubble formation 
in water, but only with a view to finding the influence of slot submergence on 
the air flow rate. Johnson et ah (J3) have indicated the influence of orifice 
geometry on bubble formation by observing that the bubbles produced through 
drilled orifices vary in size from those obtained through punched orifices. 

Krishnamurthi, Kumar, and Datta (K7) employed a circular orifice of 
arbitrarily chosen dimensions as the standard, and constructed two sets of 
noncircular orifices having either the perimeter or the area equal to that of 
the standard orifice. The configurations chosen were an equilateral triangle, 
a square, and a rectangle. The system used by these authors was air-water, 
and their studies were confined to extremely small flow rates (<0.5 cm 3 /sec). 
Their results indicate that noncircular orifices do not utilize their entire peri¬ 
meter for bubble formation, and, for equi-sided orifices at low frequencies of 
formation, the bubble is formed as if from a circle inscribed in the noncircular 
orifice. In this range, the perimeter and the area are important in determining 
the final bubble size. 

Further investigations by the above authors (K5), using an air-glycerine 
solution system (density = 1.22 g/ml, viscosity = 76.2cp and surface tension 
= 61.8 dyn/cm) confirmed the above conclusions for highly viscous liquids 
also. The authors have further correlated their results by the empirical 
relation (K8): 

(^)n. C . = (W) 0 ' 52 (106) 

where 

(V F ) n c = the bubble volume from the noncircular orifice, 

(V F )i = the bubble volume from the inscribed circle, and 

R — the ratio of the area of the noncircular orifice to that of the 
inscribed circle. 

Ramakrishnan, Kumar, and Kuloor (Rl) have extended the investigations 
of Krishnamurthi et ah (K5, K7, K8) to more geometries, much higher flow 
rates (up to 100 cm 3 /sec), liquids of still higher viscosity, and to completely 
constant flow conditions. 
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For both the standard orifices studied here, the equal-area orifices are 
found to give the same bubble volume versus Q curve as that of the standard 
orifice for all the regular equi-sided geometries and all the systems investi¬ 
gated. This fact permits the evaluation of the bubble volume from a non¬ 
circular orifice by using the equation for the circular orifice with an equivalent 
diameter. Thus it is not necessary to develop separate equations for each 
geometry. Figure 18 shows the results obtained for various geometries and 



Fig. 18. Effect of orifice geometry on bubble formation for orifices based on area 
equivalence. 

for the circular orifice based on area equivalence. For both the air-water and 
the air-glycerine systems = 288 cp) all the geometries yield single curves. 

Figure 19, which contains correspondingly results based on perimeter 
equivalence, shows that the various geometries yield different curves. The tri¬ 
angular orifice yields the lowest bubble volumes under a given set of condi¬ 
tions. The bubble volume increases as the number of sides of the orifice 
increases, the maximum being obtained from the circular orifice. The differ¬ 
ence in values is most evident at low flow rates. 

The difference arises from the fact that under constant flow conditions, 
the major function of the orifice is to determine the effect of the surface tension 
force. The change in velocity, which can be caused by changing the area of the 
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Fig. 19. Effect of orifice geometry on bubble formation for orifices based on perimeter 
equivalence. 

orifice, is of little importance in bubble formation, because of the negligible 
kinetic energy of the gas stream. For noncircular orifices of equal area, the 
total perimeter varies with the number of sides. The fewer the number of 
sides, the greater is the perimeter; thus, the equilateral triangle has the largest 
perimeter among all the geometries studied. The bubble, however, does not 
form at the edges, and the more acute the angle the larger is the section which 
does not participate in bubble formation. This again is maximum for the 
triangle. The remaining perimeter may be called “effective perimeter.” Thus, 
the unused perimeter decreases with increasing number of sides. The overall 
effect seems to be such that the effective perimeter remains the same. 

Noncircular orifices made on the basis of perimeter equivalence always 
have a smaller effective perimeter than the circular orifice. Since the deficiency 
is greater for geometries with fewer numbers of sides, the bubble volumes ob¬ 
tained from triangular orifices are the least. 

The influence of orifice geometry is closely associated with that of overall 
surface-tension force. From Sections IV and V, however, it is evident that 
the surface-tension effects are limited to small flow rates, the actual value of 
the flow rate where the effect becomes absent being dependent on viscosity 
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of liquid. Similar effects are evident from Fig. 19, where the various geom¬ 
etries yield distinct curves at low flow rates, which merge into a single curve 
at high flow rates. 

A comparison of Fig. 18 with Fig. 19 throws further light on the pheno¬ 
menon. The results obtained from area and perimeter equivalence are seen 
to be wide apart at low flow rates, but are the same at high flow rates. In 
general, the orifice geometry has no influence at high flow rates (> 80 cm 3 /sec), 
and the bubble volume can be evaluated by using the equations given in 
Section IV for situations where the surface tension is not important. For 
smaller flow rates, the general model that includes the surface-tension effect 
has to be employed, with the modification that for a noncircular orifice, the 
diameter must be replaced by the expression 

D e = (4T/;t) 1/2 (107) 

where D e is the effective diameter of a hypothetical equivalent circular orifice, 
and A is the area of the noncircular orifice. 

The modified equations apply quite well to the regular geometries, as is 
evident from Figs. 18 and 19, wherein the solid lines represent the values 
obtained from these equations. 

The situation may be different for irregular geometries like rectangle, etc., 
asisfoundbyKrishnamurthieJ ah (K7), who observe that the bubble volumes 
obtained for a rectangle made on the basis of area equivalence are higher than 
those obtained from a circular orifice. 

Some experiments conducted under constant pressure conditions indicate 
that the above equivalence does not hold for this situation. 


IX. The Influence of Orifice Orientation on Bubble Formation 
A. Introduction 

Almost all the results reported in literature pertain to studies with hori¬ 
zontally oriented orifices, and the discussions in the preceding sections are 
applicable only to such orifices. In industrial applications, however, the orifices 
are usually oriented at various angles, the most common (as in the case of the 
bubble plates) being n/2 from the horizontal. 

The first quantitative attempt (K12) in this direction was made with 
vertical orifices, under constant flow conditions. Here, the bubble formation 
is considered to be occurring in two distinct steps. In the first stage, the bubble 
is assumed to expand at the tip, moving vertically at the same time. As the 
bubble is formed at an angle to the vertical, a vertical component of the 
surface tension force will be operative during this stage. The first stage is 
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assumed to end when the buoyancy of the growing bubble just counter¬ 
balances the vertical component of the surface tension force. The bubble 
volume corresponding to this condition is denoted by V s . 

In the second stage, at any particular instant, the bubble is assumed to be 
moving with a Stokes terminal velocity corresponding to its size at that 
instant. Thus, 


dx ~ CV 2l3 (p l - p g ) 
dt p 


(108) 


The second stage is assumed to end when the base of the bubble has covered 
a distance equal to the orifice diameter. Thus, 


V F = 


CDy 


(Pi -Pg)9 


+ Kn 


(109) 


Integrating Eq. (108) within the limits of the orifice diameter, viz. from 
0 to D, and substituting Q t for V, we obtain 




pQD 


3 C(p, - p g ) 


+ V 5 J 3 


( 110 ) 


This equation, however, does not adequately account for the influence of 
viscosity, and was hence modified empirically to 

Vp 3 = 0.069 - QD(fiJfi) 0 - 81 + [ 1.22(W/O 0 ' 09 7 — . T~ \ ' (HI) 

( Pl-Pg) L (Pl~P g )9 J 

Equation (111) is reported to be applicable up to flow rates of about 0.05 
cm 3 /sec, for liquids having viscosity up to 185 cp. But apart from the approach 
being semiempirical, the flow rates studied are in too low a range to be of 
any practical importance. The problem is further complicated by the treat¬ 
ment of each angle as a separate case. 


B. Models of Kumar, Kuloor, and Co-workers 

Experimental data for much higher flow rates and for various orifice orien¬ 
tations have been collected by the above investigators, under both constant 
flow and constant pressure conditions. The data of the above workers, 
for liquids of different physical properties under constant flow conditions, 
show that for any definite set of conditions, the bubble size does not decrease 
continuously with increasing angle of orientation. The data for a viscous 
liquid are presented in Fig. 20. The orifice oriented at 15° yields higher bubble 
volumes than the one oriented horizontally. Similarly, the vertically oriented 
orifice yields higher bubble volumes than that oriented at 60°, under otherwise 
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Fig. 20. Effect of orientation of orifice on bubble formation under constant flow 
conditions. 

identical conditions. Further, the behavior varies with the flow rate and the 
viscosity of the liquid. 

These results are explained by extending the models of Kumar and 
Kuloor discussed earlier in Sections IV and V. At any orientation, the bubble 
is again considered to be formed in two distinct stages, but the conditions for 
the termination of the two stages are different in this case. 

During the first stage, the bubble initially expands perpendicular to the 
orifice, but later, both the expansion and the movement are essentially in the 
vertical direction for all the orifice orientations. The sequence of formation 
for three different orientations is evident from Fig. 21a,b. This sequence has 
been idealized in Fig. 21c. Obviously, it is difficult to obtain completely ver¬ 
tical expansion and ascent in the case of the vertical orifice if the expansion 
takes place exactly at the orifice tip. It is observed through high-speed photo¬ 
graphy that in this case the bubble neck extends slightly in the first stage itself 
and the surface-tension force is therefore not zero. The neck makes an angle 
of about 60° with the horizontal, and hence the surface tension influence for 
this case is the same as that for an orifice of 60° orientation. Thus the first 
stage remains essentially the same as that for horizontal orifices except that 
only a component of surface tension is now operative. This component is nDy 
cos <j) up to 60° and remains constant afterwards. 
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The first stage is assumed to end when the vertical components of the ex¬ 
pansion drag and the surface-tension force together become equal to the 
buoyancy. 

The second stage commences, and its characteristics in the present model 
are the same as before, except for the condition for detachment which occurs 
when the bubble has covered a distance equal to r fb cos 0 + D/2 sin 0. This 
is the most general case. For horizontal orientation, 0 = 0, and hence this 
distance reduces to r fb . 

Below are given the modified equations for both constant flow and con¬ 
stant pressure conditions. 


1. Constant Flow Conditions 


The modified equation for Vf> has to include the vertical components of 
the expansion drag and the surface-tension forces. Hence, for viscous fluids 
we have 


V%> = 


ii q 2 mvji 3 

1927r(3/47r) 2/3 0 2(3/4tt) 1/3 0A 


+ 



COS 0 


( 112 ) 


In the second stage, the bubble now covers a distance equal to (r fb 
cos 0 + D/2 sin 0). Thus, 


(r fb cos 0 + D/2 sin 0) = 




3D 


2Q(A - 1/3) 


(v* 2/3 ~ V}1 3 ) 


(113) 


where 

6(1.25);r(3/4^) 1/3 F}( 3 /i 
Q[p g + (ll/16) Pl ] 

B=(P,~ Pg)9lQ{pg + ^ P,) 

C = nDy cos <WQ^p tf + ^ p,j 
D = 3^i/2(3/47r) 1/3 (pff +Pi) 
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Fig. 21(a). Bubble formation at oriented orifices in viscous systems. 
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uimn 


Fig. 21(b). Bubble formation at oriented orifices in inviscid systems. 
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Expansion stage Detachment stage Condition of detachment 

Fig. 21(c). Idealized sequence of bubble formation at inclined orifices. 

The increase in (f) tends to reduce the first stage volume because of the 
smaller net downard force, whereas it increases the second stage volume be¬ 
cause of the distance covered by the bubble base. The final volume is the 
result of both these conflicting factors. 

This is very much marked for an orifice of diameter 0.60 cm, in an air- 
glycerol (fi — 288 cp) system, as shown in Fig. 20. At small flow rates, the 
bubble volume from vertical orifices are relatively small (though greater than 
those from 60°), but as the flow rate is increased the curve for this orifice 
crosses all the other curves, and virtually joins that for the horizontal orifice. 

The solid curves in Fig. 20 represent the values obtained by applying 
Eqs. (112) and (113). 




2. Constant Pressure Conditions 

The results obtained for these conditions exhibit the same kind of 
irregular behavior as observed for constant flow conditions. 

The model proposed for constant flow conditions is again used here to 
explain the data. The modified equations for inviscid fluids are: 


• Pg)9 = 


£ 2 [p g + (ll/16)p ; ] 
4n{3j4n) V3 


Vf b 2li \p + Pl gOI4ny ll V} ( 


2 r* 7 V /3 1 

(3/4;0 ,/3 J 


K 2 [p g + (ll/16)p,] 
327t 2 (3/47r) 2 


[4P(3/4jr) i/3 V/J 3 — 3p l g(3j4n) 2l3 V/ b /3 — lOy] + 2nRy cos (j> (114) 
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where 


(r fb cos (j> + D/2 sin </>) 

= (p'/mv?- vti - miQwy 3 - vy b 3 ) 

-VIQ)(V F -V fb ) + (1IQ) 

UV fb + 3 NV}> 3 - (P'l2)Vf b -\{\n V F - In V fb ) 


P' = (Pi~ P g )IQ[p9 + Y(> P ) 

J = nDy cos ^ 


N = e/127i(3/47i) 2/3 
Similarly, for viscous liquids: 

Up, - P,)9 - Ki 4* ( t/4!, 1 )C» 6), ’ |] Ui W, [p + P,9(3/4»)«U.' ! 

_ i#-,-, 1 ' 1 ] _ g>,+(ii;i 6)pi]K7»- 

^_ 2 /tm_x 2 [4F(3/47r) 


(3/47T) 1 


327r 2 (3/47r) 2 


- 3p/P(3/47r) 2/3 K// 3 - lOy] + 


3-KpF ^ 3 
2(3 14k) 1 ' 3 


r 2 vv _ 1/3 1 1/2 

•P + PiP(3/4^) 1/3 F / V 3 - ^i^y/2 + ZrcKy cos 0 


[»■/* cos <t> + (D/2)sin </>] = + (F f 2 - F 2 *) - (CMQ)(F f - F /fc ) 


where 


2Q(X - 1/3) 

6(1.25)tc(3/4tt) 1/3 f;/ 3 p 
Q[ p 9 + (H/16)pJ 

5 = (p i -p 9 )p/e(p 9 + ^p 1 ) 

C = rrDy cos ^/e(p fl + ^p i ) 


(F f 2/3 - F// 3 ) (117) 


D = 3p/2(3/4rc)' 
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The above equations have been tested by the authors for data 
collected over a wide range of variables (Q — 200 cm 3 /sec, up to 500 cp, 
D up to 0.40 cm) and found to be applicable. 


X. The Influence of Continuous Phase Velocity on Bubble Size 


Depending on its subtraction from or addition to the buoyancy force, the 
continuous phase velocity can either increase or decrease the bubble volume. 
Normally, this velocity is such that the bubble detaches prematurely from the 
nozzle tip. Maier (M2) has shown that the shear force experienced by the 
bubble, which causes its premature detachment, is a maximum when the 
continuous phase flows at right angles to the nozzle axis. 

Krishnamurthy et ah (K13) have confirmed the above conclusion and have 
developed an expression for evaluating the bubble volume under conditions 
of flow when the corresponding volume for static conditions is given. These 
authors used capillaries of different diameters ground at the tip as nozzles. 
The capillaries were arranged horizontally, and the fluid travelled vertically 
so as to add to the buoyancy. The liquid viscosity was varied from 1 to 30 cp 
and the surface tension from 62 to 70 dyn/cm. 

The results showed that for all nozzle diameters the decrease in bubble 
volume with an increase in the continuous phase velocity is nonlinear. Further, 
for any particular velocity, the reduction in the bubble volume is greater as 
the liquid viscosity is increased. 

The reduction in bubble volume due to the continuous phase velocity is 
attributed to an extra upward drag force, which adds to the buoyancy. Thus, 


f b = f; + f d (118) 

where 

FB ~ V F (Pl “ Pg)9 

and 

Fi-VXPi-pjg. 


F d was evaluated from the drag coefficient at bubble Reynolds number 
and the projected area of the bubble. As the Reynolds number varied from 2 
to 700, the drag coefficient C D was evaluated by the Schiller and Naumann (S4) 
equation: 

C D - (24/Re')(l + 0.15Re ,0 ‘ 687 ) (119) 


Expressing the various terms of Eq. (118) in terms of volume and simpli¬ 
fying, we obtain 

v; + — fiuv; 1/3 + 2.035(fi/p,) 0313 — v;°- S62 = v F 

9Pi 9 


9 


( 120 ) 
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Equation (120) permits the evaluation of V F from the values of V F and the 
other pertinent parameters. As an analytical solution of Eq. (120) is not pos¬ 
sible, numerical procedures have to be used. The equation can, however, be 
rendered cubic and solved analytically if the power of V F in the third term on 
the left-hand side is changed from 0.562 to 0.667. The error thus caused is 
found to be negligible (about 1 %). The equation then becomes 

V; + 112^ yji/3 + 2.035 (nlpi) 0 313 + -- Vf 13 = V F (121) 

9Pi a 

Equation (121) is solved by Cardon’s method and found to predict the 
data with an average deviation of ±6%. 

The above analysis assumes a constant downward force which is possible 
only when the surface tension is the main resisting force. This restricts the 
utility of Eq. (121) to cases where the gas flow rates employed are extremely 
small. 


XL Formation of Bubbles at Multiple Orificed Plates 

When the flow rates used are small or the interslot distances large, the 
bubble interaction does not take place and the bubbles are formed as if from 
single orifices. In this region the fractional hold-up increases with the in¬ 
creasing gas velocity because the bubble volume increases with the gas flow 
rate without corresponding change in the velocity of the rise of the bubble. 
The limit is reached when fractional gas hold-up reaches about 0.60. This 
corresponds to the condition of close packing of spheres. Under these con¬ 
ditions, the neighboring orifices restrict the bubble diameter in the horizontal 
direction, and the vertical deformation in bubbles occurs, if the diameter of 
the bubble (from a single orifice) is greater than the pitch of the perforation. 

As the gas flow rate is further increased, Calderbank and Rennie (C4) 
consider two possibilities. The bubbles may increase in size but deform into 
polyhedra to yield cellular foams containing very little liquid. Alternatively, 
the foam may break due to kinetic energy of the gas and bubbles may burst 
into much smaller units, resulting in froth. Higher vapor velocities (>15.2 
ft/sec) generate froth (Tl). Cellular foams are formed for low interslot dis¬ 
tances, low gas flows, low submergence, and small diameter holes, whereas 
froths are formed for high flow rates and submergences (C4, Tl). 

Calderbank and Rennie (C4) and Rennie and Evans (R5) have found 
Sauter mean bubble diameters both photographically and from foam den¬ 
sities by y-ray absorption technique. Their bubble size could be predicted by 
the equation of Leibson et ah (L2) with a frothing system, for orifice Reynolds 
numbers between 2000 and 10,000. Thus, 
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d = 0.713«e; OO5 cm (122) 

The hold-up can be evaluated by the relationship developed by Crozier 

(CIO): 

In —^ = 0.715r s (p g ) 1/2 + 0.45 (123) 

1 Hg 

where v s is the superficial gas velocity in feet per second, and p g is the gas 
density in pounds per cubic foot. 

By making use of Eqs. (122) and (123), the interfacial area a can be 
calculated by 

a = 6HJd (124) 

The work of Calderbank and Rennie (C4) has been criticized by Sargent 
and Macmillan (S2) on the basis that the liquid flow conditions used by 
Calderbank and Rennie (C4) are not found in distillation columns. They (S2) 
consider that cellular foams are formed for dilute aqueous solutions only 
when low gas flow rates are employed. By using an /i-pentane-isopentane 
system, Macmillan (Ml) found that for all gas flow rates, froths with densities 
less than 0.15 were formed and the froth densities were independent of the 
factor v s (p g ) 112 but dependent on tray geometry. The associated problem of 
foam stability has also attracted considerable attention (Al, D3, Zl). 


XII. Formation of Drops 

Drops, like bubbles, can be formed by nozzles, and also by microburettes, 
spinning disks, or swirl chambers. The various methods have been discussed 
by Batchelor and Davis (Bl) in their monograph. The simplest method of 
forming drops is from nozzles and is most commonly employed; the present 
review is restricted to this mode only. 

A. Influence of Variables on Drop Size 

When a dispersed phase is passed through a nozzle immersed in an im¬ 
miscible continuous phase, the most important variables influencing the re¬ 
sultant drop size are: the velocity of the dispersed phase, viscosity and density 
of continuous phase, and the density of the dispersed phase (G2, HI, H5, 
M3, Nl, P5, R3, S5). In general, an increase in continuous-phase viscosity, 
nozzle diameter, and interfacial tension increases the drop volume, whereas 
the increase in density difference results in its decrease. However, Null and 
Johnson (N4) do not find the influence of continuous-phase viscosity sig¬ 
nificant and exclude this variable from their analysis. Contradictory findings 
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are also reported regarding the influence of volumetric flow rate of the dis¬ 
persed phase. Hayworth and Treybal (H5) find that when drop volume is 
plotted versus volumetric flow rate, the curve has one maximum. Null and 
Johnson (N4), on the other hand, report two peaks in the V F versus Q curve. 
Kumar, Kuloor, and co-workers (K2, R3) find one maximum in the flow range 
studied by them. The maximum is shifted (R3) when the viscosity of the con¬ 
tinuous phase is increased and no maximum is observed when the continuous- 
phase viscosity is raised to quite high values (>5 poise). The maximum has 
also been found to be absent when the dispersed-phase viscosity is increased 
(K2), though it is possible that if higher flow rates are investigated, the maxi¬ 
mum would appear. 

The influence of dispersed-phase viscosity was found to be negligible by 
Hayworth and Treybal (H5), but found to be significant (K2) when a greater 
range of dispered-phase viscosity was investigated. From the graph of 
Hayworth and Treybal, the influence of interfacial tension appears, as in the 
case of bubbles, to be more at low flow rates than at high flow rates. 

At vanishingly small flow rates, the drop volume calculated by the equali¬ 
zation of the buoyancy with interfacial tension gives drop volumes higher 
than those observed experimentally, because of the residual drop. Harkins’ 
(H2) correction has therefore to be applied to calculated drop volumes under 
these conditions. Thus, 


V s ='f 1 mid) 
hpg 


(125) 


This correction has been incorporated in most of the models developed 
for drop formation. 


B. Hayworth and Treybal Model 

The model is based on the assumption that the drop detachment does not 
take place until the velocity v of the drop is larger than the velocity of the dis¬ 
persed phase in the nozzle, and also until the buoyancy force equals the force 
due to interfacial tension. 

Another force acting on the drop is due to the kinetic energy of the dis¬ 
persed-phase stream, which adds to the buoyancy force. Each of these forces 
has been considered to be contributing to the final drop volume. Thus, the 
final volume becomes 


V f =V s +V v -V k 


( 126 ) 
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where V v is the volume representing the force experienced by the moving drop, 
and V K is the negative volume resulting from buoyancy. The term V s is given 
by Eq. (125). The expressions for V v and V K are 


V. = 0.5236 


18 Kv'» c 


L (A p)g 


3/2 


where K is a correction to Stokes law, and 


V K = 


VjW? 
2 A pgd 


(127) 


(128) 


Substituting Eqs. (125), (127), and (128) in Eq. (126), we obtain 


V F + Vl /3 


0.403p d v' c 2 l 7i Dy 


9 Ap 




A pg 
+ 0.5236 


nm 

18 Kv’ c n c 


L kpg 


3/2 


(129) 


Equation (129) has been simplified to avoid trial and error procedure by 
taking i/ / (D/d) = 0.655 as a first approximation and expressing K empirically 
by 


4.05D 0 747 

^ — ../ 0 . 635 .. 0.814 (130) 

Vc re 


Introducing the above simplification in Eq. (129), we obtain 


V F 


+ 4.11(10 _4 )F| /3 


- pm 

- A p . 


21(10 _4 )(yD/Ap) 


+ 1.069(10 -2 ) 



(131) 


Equation (131) can be used for the evaluation of V F . The use of this equa¬ 
tion has been simplified by having a graphical solution, which represents the 
drop diameter graphically as a function of two variables, 


and 


J = 0.00411 


- P _m 

- Ap . 


H = 0.0021 



+ 0.01069 



The fi c here is in centipoise. 

The average deviation reported by the authors is 7.5%. 
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C. Null and Johnson Model 

These investigators have considered the drop formation from the view¬ 
point of the geometry of the forming drop, as actually observed with the aid 
of stroboscopic light. Two steps of drop formation, viz. the growth at the tip 
and the detachment stage, have been recognized. Before breakage the drop is 
considered as a sphere atop, and tangent to, a right truncated cone passing 
through the circumference of the nozzle. 

The final graphical correlation given by the authors is in terms of the 
following two groups: 



A prediction accuracy of 10 to 30% is claimed. 

D. Models of Kumar, Kuloor, and Co-workers 

1. Model of Rao , Kumar , and Kuloor (i?3) 

This model considers the drop formation to take place in two stages, the 
expansion stage when the drop inflates at the nozzle tip and the detachment 
stage when the drop rises, forms a neck, and finally gets detached from the 
nozzle. The first stage is assumed to end when the buoyancy becomes equal 
to the interfacial tension force. For the termination of the second stage two 
conditions have been used, which result in two values of time of detachment. 
The lower of the two values is employed for calculation. 

The static drop volume is given by the equation 

2(132) 
(Pc — Pi)9 

For evaluation of t c , two models have been proposed. 

where </>(i?/F 1/3 ) is a correction factor suggested by Harkins and Brown (H2). 

When the static drop stage is passed, the drop starts rising with varying 
velocity, but still maintaining its connection with the nozzle through a neck. 
As further liquid is pumped into the drop during the time of detachment t c 
also, the final drop volume becomes 

Fp — F Qt c 033) 

For evaluation of t c9 two models have been proposed. 

a . Model L At the time of release, the detaching drop leaves behind a 
hemispherical residual drop. This model considers that the drop detachment 
occurs when the rate of growth of the hemisphere is equal to the velocity of 
the moving drop. 
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The rate of growth of the hemisphere is found to be 

dr/dt = \v' c (134) 

where v' c is the velocity of the dispersed phase in the nozzle. 

The movement of the drop in the second stage is governed by the equation 
of motion. Thus, 

d(Mv)/dt = Q(p c - p d )gt + Qv c p d - 6nrpv (135) 

Here, the virtual mass M of the drop can be assumed to be constant and 
equal to that of the static drop, because there is not much growth before 
detachment. The mass term is written as 

M=V sPd + ( U/l6)V sPc (136) 

where (ll/16)F s p c is the correction which accounts for the inertia of the con¬ 
tinuous phase and which assumes the flow around the drop to be irrotational 
and unseparated, as suggested by Davidson and Schuler (D9). The equation 
of motion now becomes 

M(dv/dt) = Q(p c - p d )gt + Qv c p d - 6nrpv (137) 

The above equation is a first-order linear differential equation which on 
solution gives 

- C A +B {i-h) + {?-$) Ca «- A,) (138) 

where 

A = 6nrplM 
B = Q(p c - p d )g/M 
C = Qv’cpJM 

The integration constant for Eq. (138) is calculated by imposing the con¬ 
dition v = 0, when t = 0. At detachment, the condition v = (l/2)t>' is applied 
and the corresponding time of detachment (t = t c ) is evaluated. This model is 
useful only when the distance between the detachment drop and the residual 
drop is negligible. 

b. Model II. This model is applicable to highly viscous continuous phases. 
This model assumes necking to take place before final detachment. The 
length of the neck is equal to the diameter of the static drop. Thus the time 
of detachment is that required by the ascending drop to cover a distance equal 
to the diameter of the static drop. 

For viscous liquids, all the terms except B/A of Eq. (138) are small and 
can be neglected. Thus, 


v = dxjdt = BtjA 


(139) 
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Equation (139) is solved with the boundary condition: 

x = 0 at t - 0 

The time of detachment is evaluated from the condition x = d s when 
t = t c . Thus, 

t c ~(2Ad s /Byt 2 (140) 

The final volume is evaluated by making use of Eqs. (132), (133), and (140). 


2. Inclusion of Dispersed Phase Viscosity 


Kalyanasundaram, Kumar, and Kuloor (K2) found the influence of dis¬ 
persed phase viscosity on drop formation to be quite appreciable at high rates 
of flow. The increase in p d results in an increase in drop volume. To account 
for this, the earlier model was modified by adding an extra resisting force due 
to the tensile viscosity of the dispersed phase. The tensile viscosity is taken as 
thrice the shear viscosity of the dispersed phase, in analogy with the extension 
of an elastic strip where the tensile elastic modulus is represented by thrice 
the shear elastic modulus for an incompressible material. The actual force 
resulting from the above is given by 3nRfi d v. 

The first stage is evaluated by 

V s = (2nRy)/(p c - p d )g (141) 

The equation governing the second stage becomes 

M(dv/dt) = Q(p c - p d )gt + Qv c p d - ( 6nr s p + 2nRp d )v (142) 

Expressing v as dx/dt and solving the equation with the boundary con¬ 
ditions at t = 0, v = 0, and x = 0, and putting t = t c for x = d s . 


where 



(143) 


A = (6nr s p)/M 
B = (QApg)IM 
C — (Qv c p d )/M 


Equation (143) permits the evaluation of the time of detachment t c by 
trial and error procedure. 

The final drop volume can then be obtained by using Eq. (133). 
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E. Comparison of Various Models 

The equation of Hayworth and Treybal (H5) is semi-empirical and is 
based on a force balance made by expressing the various contributing forces 
acting on the drop as fractions of the total drop volume. This procedure is 
probably not wholly justified, since the exact instant at which the forces act 
is not known, nor is their quantitative contribution to the total volume. The 
model also neglects the influence of dispersed-phase viscosity. 

The model of Null and Johnson (N4) neglects the viscosity of the con¬ 
tinuous phase which has been found to be effective (K2). 

Null and Johnson (N4) compared their experimental data with the avail¬ 
able models and found a maximum deviation of 377 % with the model de¬ 
veloped by Hayworth and Treybal (H5) and 94% with their own model. 



Fig. 22. Effect of viscosity of continuous phase on drop volume. 
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Volumetric flow rate(cm3/sec) - +* 

Fig. 23. Effect of volumetric flow rate on drop volume. 



Interfacial tension (dynes/cm) - 

Fig. 24. Effect of interfacial tension on drop volume. 
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Rao et ah (R3) have compared their experimental results with various 
models regarding the prediction of the influence of various parameters on 
drop volume. Their findings are presented in Figs. 22-24. Their model seems 
to predict the influence of viscosity of the continuous phase and volumetric 
flow rate on the drop volume better than the other models, but there is con¬ 
siderable deviation for interfacial-tension effect. The interfacial-tension change 
was brought about by these investigators by the addition of the surface active 
agent Lissapol; the surface active agents have a tendency to concentrate at 
the surface rather than in the bulk (C8). Thus the interfacial tension value 
measured and used by the authors in their calculations may have been much 
lower than the one operating during drop formation in the liquid bulk. 

However, the model of Rao et ah (R3) does not consider the influence of 
dispersed-phase viscosity. Further, the maximum size of the drop is limited 
to static drop size, which is true only for low flow rates. 

The improvement in this model (K2) takes the dispersed-phase viscosity 
into consideration and predicts better than the earlier models for situations 
when the dispersed phase is viscous. A typical set of values is shown in Fig. 25, 
from which it is seen that the model predicts better results in high flow range 
only. At lower flow rates, the predicted values are higher because the drop 
detaches at the nozzle tip itself and the application of Harkins and Brown’s 
(H2) correction becomes important, which has been neglected in the model. 



0.0 0.10 0.20 00 0.10 0.20 
Volumetric flow rate (cmVsec) 


Fig. 25. Effect of viscosity of dispersed phase on drop volume. 
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The verification of these models at higher flow rates has not been made. 
Hence they can be applied only in the ranges of conditions at which they have 
been tested. However, the model of Hayworth and Treybal (H5) has been 
found reasonably applicable for drop formation during spraying in the ab¬ 
sence of mass transfer. 


XIII. Drop Formation in Non-Newtonian Fluids 

Though most of the industrial fluids show non-Newtonian characteristics, 
the drop formation studies in them have not been reported. The results will 
very strongly depend on whether the non-Newtonian fluid forms the dis¬ 
persed or continuous phase. 

A. Drop Formation in Power-Law Fluids 

Experiments have recently been conducted by Saradhy and Kumar (SI) 
for drop formation of benzene in a C.M.C. 1 solution which followed the 



Fig. 26. Comparison of the model (SI) with the collected data for drop formation in 
non-Newtonian liquids. 


*Carboxy Methyl Cellulose. 
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power-law model. The results follow drop-volume versus flow-rate curves 
which pass through maximum, and are similar to the ones obtained for New¬ 
tonian fluids (R3). Figure 26 presents a set of data for a solution having 
n = 0.617 and k — 0.0059, which shows a maximum for each of the three 
nozzles studied. The unified model (K14) has been extended to explain the 
data collected. The first stage of drop formation remains the same except for 
the modification of the drag term. The second stage must consider the equa¬ 
tion of motion of a spherical bubble in a power-law fluid. This has been 
studied extensively (SI3, T2, Wl, W2). The drag coefficient is given by 


C D = 24 XJN' Re (144) 

where X n is a function of flow index n, and N' Re is the modified Reynolds 
number, defined as 

N' Re = (2r) n pv 2 ~ n /k (145) 

X n is not known rigorously, but its upper and lower bounds have been 
calculated (W2). 

From Eqs. (144) and (145) and the definition of C D , the downward resisting 
force becomes 


F d = (247iX n kv n )l2 n + 1 r n ' 2 ) 


(146) 


Introducing the above drag expression Eq. (146) in the force balance 
particle equation of unified model (K14) ,we obtain 


VfbiPc - Pi)9 - 


e 2 [p, + (ll/16)p c ]F / -^ 3 

127r(3/47i) 2/3 


-b nDy cos 9 


24 X n kQ\V fb y (3n - 2)/3 

+ 2 3n+X 7I »-l(3/4 7I ) ( 3 n -2) /3 


-Qp d v' c (147) 

Equation (147) neglects the influence of dispersed-phase viscosity by con¬ 
sidering the dispersed phase to be inviscid, which is true for benzene. 

The equation for detachment stage is 


dv/dT + X(t>7T ( " +1)/3 ) + v[T= B -£>T' (3n+1)/3 -CJ" 1 - £T" 5/3 (148) 
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where 

v = Q(dx/dT) 

T=V fb + Qt 

24nX„k 

2 n+ 1 (3/4n) (n ~ 2)/3 [p d + (U/16)p c ]Q 

B = ~ Pd)g 

GIA + ( ll/16)pj 


QLPi + (H/I6)p c ] 

0 = _ 24X n kQ"- 1 _ 

2 3n+l n n -\3l4n) i3n - 2)/3 lp i + (ll/16)pj 

E = nM^p ;Z = - a,,A + nDy me 

Equation (148) is solved with boundary conditions 


t = 0, dx/dT = 0, r = and x = 0 


The volume is the value of T for x = . This is obtained numerically 

by using the Runge-Kutta method. The results are shown as dotted lines in 
Fig. 26, where they show excellent agreement in the range investigated. For 
calculation purposes, the value of X n has been used as the mean of the upper 
and the lower bounds found by Wasserman and Slattery (W2). 

Use has also been made of the other model (R3) to explain the results. 
The calculated values through this model are also shown in Fig. 26. The 
model gives very poor agreement. 


B. Power-Law Drops in Newtonian Fluid 

For this case, the first stage remains the same as before, except for the 
inclusion of the dispersed-phase viscosity, but the equation of motion for the 
second stage is changed to 

dv/dT +A(v/T) = B - DT“ 4/3 - CT" 1 - ET~ 513 - Fv n (149) 
where B , C, and E remain the same as in Eq. (148) and A , D , and F are 

i al4 

iPt+imvpjQ 

__6t irpv f __ 

[Pd + (H/16)p c ]Q 
F = 3knR 2 ~ n 
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This equation is solved with the same boundary conditions as Eq. (148) 
and the time of detachment is calculated. 

C. Power-Law Drops in Power-Law Fluids 

The equation governing the second stage for this situation becomes 

dv/dT + A ” 1)/3 + ^=B- £>r-(3»+D/3 _ C T~ l - ET~ 5 ' 3 - Fv” 

T T (150) 

where 

F — 3knR 2 ~ n 

This equation too is solved with the same boundary conditions as Eq. (148). 
A series of equations results when different combinations of fluids are 
used. There is no change for the first stage. All the terms of equation of 
motion remain the same except the force terms arising out of dispersed-phase 
and continuous-phase viscosities. The main information required for formu¬ 
lating the equations is the drag during the non-Newtonian flow around a 
sphere, which is available for a number of non-Newtonian models (A3, C6, 
FI, SI 3, SI 4, T2, W2). Drop formation in fluids of most of the non-Newtonian 
models still remains to be studied, so that whether the types of equations 
mentioned above can be applied to all the situations cannot now be 
determined. 


XIV. Drop Formation from Vertically Oriented Orifices 


In the absence of surface tension influences, the drop formation at vertical 
orifices is expressed by the equation given for bubble formation. The force 
due to kinetic energy of the liquid is neglected as its component is zero in the 
vertical direction. The drop ascends right from the beginning according to the 
equation of motion and detaches when it has covered a distance equal to the 
diameter of the nozzle. 

For a viscous liquid the equation of motion is 

d(Mv)/dt = (A p)gQt — 6nrpv (151) 


Equation (151) reduces to the inviscid case when p is put equal to zero. 
Expressing v in terms of the distance covered, 

0 d/dt)lM(dx/dt )] - (Ap)gQt - 6(3/4ny f3 pQ 1/3 t 1/3 (dx/dt) (152) 


This equation, when solved between x = 0 to D for t- 0 to t c , yields 


D = A 


31 5/3 5 * 4/3 20 1 


5 B 4B 2+ 9B 3 18B 4 ' 27B 5 


60r 2/3 + 120< 1/3 120 ln(3B? I/3 )] 


81S 6 


(153) 
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where 

a = iEi z 

I '.Pi + (H/l 6)pJ 

and 

6w(3/47t) t/3 /iG~ 2/3 

[^ + (ll/16)p c ] 

Final drop volume V F = £)t. 

For the inviscid case, the final equation is 

t = {2DIA) l{1 (154) 

where 

A (Pc - Pd)g 

[Pd + (H/16)p c ] 

Final drop volume = Qt. 

No data are available for other orientations, but equations similar to those 
for bubble formation, taking the appropriate component of the force due to 
kinetic energy, will be applicable. 

When surface-tension effect is significant, a component of this force be¬ 
comes operative which has not been theoretically calculated (K12). The drop 
now has a static volume-value which must be added to the volume computed 
through equation of motion with appropriate boundary conditions. These 
equations have been found to be applicable at small flow rates by Shankar 
Srinivas and Kumar (S7). 


XV. Spraying and Atomization 
A. Liquid-Liquid Systems 

Considerable attention has been paid in literature to the formation of 
liquid sprays in liquids and gases, through the disintegration of jets. The 
liquid-liquid systems have been studied mainly with a view to understanding 
the operation of spray extraction towers. Attempts to measure the drop sizes 
in liquid-liquid sprays have been made both at lower and higher flow rates 
(A2, J4, K3, L3, S8, W3). At low flow velocities through the nozzles, the data 
on drop size in the absence of mass transfer (L3, S8, W3) can be predicted 
to a rough approximation by the model of Hayworth and Treybal (H5) dis¬ 
cussed in Section XII,B. Other models have not as yet been tested for spraying. 
The main factors influencing the drop sizes produced are: the nozzle diameter, 
volumetric flow rate of the dispersed phase, and the system properties. 
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When the flow rate is made exceedingly large, flooding is observed 
(B4, M5) when the whole column gets filled by closely packed spheres, and 
further increase in flow rate results in the rejection of the dispersed phase at 
the entrance region itself. In the vicinity of these conditions, the drop forma¬ 
tion mechanism is bound to be influenced by the presence of dispersed phase. 
The conditions and the theory of flooding in the spray column have been 
worked out (M5), but apply only in the absence of mass transfer. From study¬ 
ing a wide range of flow rates, Keith and Hixson (K3) point out the various 
stages of jet formation. 

At vanishingly small flow rates, the drops get detached at the nozzle tip. 
As the flow rate is increased, the jetting point is reached where a very short 
continuous neck of liquid exists between the nozzle tip and the point of 
detachment. Further increase in the flow rate lengthens the jet, whereas still 
further increase to the critical velocity gives the jet a ruffled appearance. 
Beyond this region, the jet recedes to the nozzle tip. 

Below the jetting and the varicose point the drops obtained are of uniform 
size. Near the critical region, the nonuniformity is a maximum, whereas at 
still higher velocities the sizes become slightly more uniform, producing 
smaller drops. Near the critical region, a number of large drops appear which 
terminate the constant increase of surface area found below this region. Due 
to this phenomenon, an optimum flow rate is observed which gives the 
maximum area for unit volume of the dispersed phase. 

The uniformity of the particles as measured by the standard deviation for 
a normal probability curve was found to be a function of flow rate, nozzle 
size (better for smaller nozzles), nozzle length (decrease in nozzle length de¬ 
creases uniformity), etc. A decrease in interfacial tension is insufficient to 
cause change in uniformity. 

The range of dispersed-phase velocity studied by Keith and Hixson (K3) 
is from 10 to 30 cm/sec which, according to those authors, is of industrial 
interest. The results obtained by them in the absence of mass transfer can be 
predicted roughly by extrapolation of the Hayworth and Treybal correlation. 
In the presence of mass transfer, the results obtained (F2), the drop size dis¬ 
tribution, flooding, etc. are different from those observed in the absence of 
mass transfer. There is no reliable theory at present which can predict the 
drop size distribution in sprays, though rough approximations are possible 
when mass transfer is completely avoided. 

B. Liquid-Gas Systems 

A number of techniques have been evolved to disperse liquids in gases in 
the form of fine droplets. The various atomizing techniques are jet injections, 
fan sprays, centrifugal nozzles, twin fluid atomizers, impinging jets, and rotary 
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atomizers. Only the jet injections system is discussed here on the ground that 
the mechanism of its working is allied to the rest of this study. In the case of 
centrifugal nozzles and rotary atomizers, the process of drop formation is 
more due to the disintegration of liquid sheets than to the detachment of 
liquid drops from a liquid column. Spraying results in particle diameters pre¬ 
dominantly between 100 and 1000 microns, whereas atomization results in 
droplets of 100 microns diameter and smaller (01). The various factors influenc¬ 
ing liquid disintegration are: the liquid properties and flow rate, pressure, 
nozzle design, and the condition of the gas into which the liquid is injected. 
The problem of jet breakage leading to droplet formation has been considerably 
studied since the work of Rayleigh (R4). Though the main force participating 
in jet disintegration is the surface tension, other system properties like the 
liquid density, its viscosity, and its velocity in the gas also contribute to the 
disintegration process. 

Small disturbance causes the formation of crests and troughs on the jet, 
which intensify as long as there is a velocity difference between the gas and 
the wave surfaces. This can result in finally disintegrating the jet. The jet itself 
has turbulent eddies which contribute to jet disintegration. This factor has 
greater importance at high flow rates, so much so that a series of liquid liga¬ 
ments form, which break into small droplets (Dl, D2). Satellite droplets of 
1 to 2 microns diameter are formed along with the main droplets. A further 
complicating factor is the secondary breakup of droplets already formed. The 
criterion for secondary breakup is found experimentally as p g v)fy = constant 
(L4), the maximum stable droplet size being r st = 8 yl(p g v])\ when the air is 
also in motion, the mechanism of disintegration is different (W4). 

Because of such factors as wave formation, jet turbulence, and secondary 
breakup, the drops formed are not of uniform size. Various ways of describing 
the distribution, including the methods of Rosin and Rammler (R9) and of 
Nukiyama and Tanasawa (N3), are discussed by Mugele and Evans (M7). A 
completely theoretical prediction of the drop-size distribution resulting from 
the complex phenomena discussed has not yet been obtained. However, for 
simple jets issuing in still air, the following approximate relation has been 
suggested (P3): 


d L = 



(155) 


where d L is the linear mean droplet diameter and p is the pressure of the 
liquid. 

The other kinds of atomizers have been extensively studied and the avail¬ 
able literature has recently been compiled (Ol). 
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XVI. Unified Model for Drop and Bubble Formation 


Though the topics of bubble and drop formation have been treated sepa¬ 
rately by almost all the investigators, the two phenomena are quite similar 
as far as the influence of most of the variables is concerned. Even in the for¬ 
mation observed visually, the expansion and the detachment stages are simi¬ 
lar. The main differences between the two arise from such properties as den¬ 
sity, viscosity, and velocity of the dispersed phase, which cannot be neglected 
during the drop formation but are unimportant during bubble formation. 
Further, for vanishingly small flow rates, the drop formation must take the 
residual drop into consideration. Kumar (K14) has extended the bubble 
formation concepts of Kumar and Kuloor (K16, K18, K19, Rl) for constant 
flow conditions to take the above factors into consideration. The velocity of 
the dispersed phase through the nozzle has been considered to yield a kinetic 
force which adds to the buoyancy as in drop formation (R3). The dispersed- 
phase viscosity is considered to add a tensile force which resists detachment 
(K2). 

The drop formation is considered to proceed exactly in the same fashion 
as the bubble formation under constant flow conditions, viz. the two step 
(the expansion and detachment) mechanism. The tensile force does not arise 
in the expansion stage because there is no neck formation. 

The first-stage equation can now be written as 


V f b(Pc ” Pd)9 = 
(I) 


Q 2 [p, + (ll/16)p f ]^ 3 
127t(3/47r) 2/3 

(II) 


3 fiQV7„ 113 

2(3/4tt) 1/3 

(HI) 


+ 7 iDy cos 9 — 

nR 2 

(IV) (V) 


(156) 


Equation (156) is applicable for both bubble and drop formation. Thus, when 
term (V) is negligible, it describes bubble formation for all fluids when surface 
tension is important. On further removal of term (IV), it applies for viscous 
liquids in the absence of surface tension. If only terms (I) and (II) are retained, 
the equation applies to the inviscid case without surface tension. 

The equation of motion for the second stage is modified by the inclusion 
of both the kinetic force and the tensile force due to the dispersed-phase 
viscosity. 

The basic equation now becomes 

dv/dT + A(v/T) = B- Z)T~ 4/3 - CT _I - £T~ S/3 


(157) 
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where 

Q[p d + (U/I6)pj 2Q[p d + Ull6)pJ 
B _ (Pc-Pi) 

ei>i+(ii/i6)p«] 

c __ ^Dy cos 9 _ 4Qp d _ 

~ QlP* + (11/16)p e ] + (ll/16)p c ] 


2(3/4jt) 1/3 [p (j + (ll/16)p c ] 


12ti(3/47e) 2/3 

Equation (157) also reduces to various special cases of bubble formation 
when particular terms become insignificant. 

When the drop detachment takes place farther from the nozzle, the 
surface-tension term in Eq. (157) must be dropped. 

The final solution with the same condition of detachment (distance 
covered by the base of the bubble or drop equals r fb ) becomes 

>> = 2 


2Q(A - 1/3) 

1 / v f a+1 
Q\-A+l 


(F| /3 - V}' 3 ) ■ 


Q(A - 2/3) 


y-A+1 

V fb 


0'J 1 - t'n ) 


\l B C D , 

LUTi v, ‘ A y » a^Ti 5 v » - T= 2/3 JJ 


Equations (156) and (158) are applicable to bubble formation under 
constant flow conditions because all the pertinent equations can be obtained 
from Eqs. (156) and (158). 

The equations have also been tested for drop formation under arbitrarily 
chosen conditions by making use of available data (Kl, R2). The results are 
given in Table IX for comparison. The corresponding values calculated by 



TABLE IX 

Comparison of Drop Volume Data Available in Literature with Those Calculated by the Unified Model 


Drop volume (cm 3 ) 


System 

D 

(cm) 

P 

(poise) 

Pd 

(poise) 

r 

(dyn/cm) 

Pc 

(gm/cm 3 ) 

pa 

(gm/cm 3 ) 

Q 

(cm 3 /sec) 

Experimental 

Calculated 





1. West et al. (W3) 




Benzene-water 

0.162 

0.009 

0.006 

32.2 

0.9968 

0.87 

0.064 

0.0952 

0.1566 








0.087 

0.0867 

0.1599 








0.247 

0.0692 

0.1724 





2. Null and Johnson (N4) 




Benzene-water 

0.252 

0.0101 

0.0065 

28.9 

0.9972 

0.8639 

0.1472 

0.1738 

0.2211 








0.4370 

0.1683 

0.2552 








0.6460 

0.1343 

0.2499 








0.5310 

0.1518 

0.2559 





3. Hayworth and Treybal (H5) 




Benzene, alka- 

0.472 

0.0101 

0.0065 

10.9 

0.9982 

0.8788 

0.0910 

0.1954 

0.1711 

terge-water 







0.4900 

0.2680 

0.2391 








1.7500 

0.3330 

0.2279 

Benzene, alka- 

0.155 

0.0101 

0.0065 

10.9 

0.9982 

0.8788 

0.0155 

0.0610 

0.0514 

terge-water 







0.0624 

0.0919 

0.0599 








0.2457 

0.1021 

0.0571 

Benzene-water 

0.472 

0.0101 

0.0065 

25.7 

0.9982 

0.8788 

0.1435 

0.3330 

0.3899 








0.4550 

0.4076 

0.4632 








1.1375 

0.4632 

0.5456 

Benzene, alka- 

0.472 

0.0101 

0.0065 

3.3 

0.9982 

0.8788 

0.0770 

0.0871 

0.0599 

terge-water 







0.5600 

0.2209 

0.1081 



4. Rao et al (R3) 


Benzene-water 

0.243 

0.009 

0.0059 

32.2 0.9968 

0.87 

0.0224 

0.1750 

0.2121 







0.0950 

0.2050 

0.2348 







0.3650 

0.2190 

0.2739 







0.5700 

0.1890 

0.2749 

Benzene-water 

0.110 

0.009 

0.0059 

32.2 0.9968 

0.87 

0.0119 

0.0633 

0.0961 







0.0548 

0.0842 

0.1060 







0.0975 

0.0864 

0.1099 







0.1820 

0.0691 

0.1044 

Benzene- 

0.243 

4.77 

0.0059 

20.7 1.2510 

0.87 

0.0061 

0.0371 

0.0568 

glycerine 






0.0499 

0.0637 

0.0858 







0.1120 

0.0882 

0.1106 







0.3380 

0.1525 

0.1734 





5. Kalyanasundaram (Kl) 




Glycerol solution 

0.160 

0.0090 

0.1536 

27.05 1.590 

1.184 

0.005 

0.0280 

0.0362 

carbon tetra¬ 






0.010 

0.0290 

0.0376 

chloride 






0.030 

0.0328 

0.0405 







0.050 

0.0362 

0.0435 







0.100 

0.0443 

0.0482 







0.200 

0.0575 

0.0523 

Glycerol solution 

0.220 

0.0090 

0.1536 

27.05 1.590 

1.184 

0.005 

0.0412 

0.0493 

carbon tetra¬ 






0.010 

0.0427 

0.0506 

chloride 






0.030 

0.0478 

0.0547 







0.050 

0.0520 

0.0579 







0.100 

0.0597 

0.0640 







0.200 

0.0698 

0.0719 
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the individual models developed by the earlier investigators are also given 
(K2, R3). It is seen that this model gives values which are as good as the earlier 
models but has the advantage of being general. The model cannot, however, 
be applied to drop formation at very low flow rates because the Harkins 
correction, which has been neglected in Kumar’s analysis, then becomes 
important. 


XVII. Bubble and Drop Size in Stirred Vessels 

In stirred vessels the formation and maintenance of dispersions involves a 
balance of disintegration and coalescence of the dispersed-phase particles in a 
continuous phase that is typically in turbulent flow. The disintegration of a 
globule can occur either by viscous-shear forces or by turbulent pressure 
fluctuations. Clay (C7) has proposed turbulence as the primary cause of 
breakage. The various kinds of globule deformation resulting in final breakage 
have been discussed by Hinze (H6) who expresses the forces controlling defor¬ 
mation and breakup in terms of a Weber group and a viscosity group. 
Breakup occurs when the Weber group exceeds a critical value which, in turn, 
depends upon the mechanism of globule breakage. For a dispersion in tur¬ 
bulent flow the critical Weber group is nearly unity and the corresponding 
value of viscosity group is small. Calderbank (C2) has further elaborated the 
work of Hinze (H6) and shown that viscous forces may be ignored in the 
dispersions of gases and liquids. Shinnar and Church (S9), while discussing 
the particle size of agitated dispersions on the basis of Kolmogoroff’s postu¬ 
late of local isotropy, conclude that the gas bubbles do not become unstable 
due to oscillation of the globule as a whole, and the breakup of the bubbles is 
caused by the breakage of the surface layer by viscous shear. 

Under noncoalescing conditions, the size of gas bubbles in gas-liquid 
dispersions can be obtained by balancing the surface-tension forces with those 
due to turbulent fluctuations, which results in (C3, H3): 

rfm = /c [(^/G)°V 2 ] (159) 

where d m is the diameter of maximum surviving bubble in centimeters, y is 
the interfacial tension, W is the horse power dissipated, and G is the volume 
in cubic feet. 

For a homogeneous and isotropic turbulent field, Hinze (H6) has shown 
that k in Eq. (159) equals [(l/2)(AV<? c ,.)] 3/5 > where N Wecr is the critical Weber 
number of maximum bubble size capable of survival. 

Detailed investigations of dispersions in agitated and baffled vessels have 
been made by Vermeulen, Williams, and Langlois (V3), Rodger, Trice, and 
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Rushton (R6), Rodriguez, Grotz, and Engle (R7), and by Calderbank (C3). 
Vermeulen et ah (V3) have recognized the physical properties of the two fluids, 
volume fraction of the dispersed phase, stirring speed, and factors associated 
with the agitator geometry as the pertinent variables influencing the particle 
diameter in the agitator. They have made a detailed study of the first three 
sets of variables. The influence of the agitator geometry and other mechanical 
factors has been studied by Rodriguez et ah (R7). Vermeulen et ah (V3) 
report that an increase in agitator speed reduces the particle diameter both for 
liquid-liquid and gas-liquid dispersions; the decrease in interfacial tension 
decreases particle size; the viscosity of the continuous phase has no influence 
for liquid-liquid systems but significant effect for gas-liquid systems where 
the increase in viscosity increases the bubble size; the dispersed-phase vis¬ 
cosity increases the drop size but its effect is inconclusive in gas-liquid systems 
because of the nearly equal viscosities of the gases used; the density differences 
are not important in liquid-liquid systems and the gas density is not important 
in gas-liquid systems; the mean diameter of the particle decreases with de¬ 
crease in volume fractions of the dispersed phase both for liquid-liquid and 
gas-liquid systems. Further, the authors find that for some liquid-liquid and 
all the gas-liquid systems there is an increase in the particle diameter at points 
away from the impeller tip. Calderbank (C3) observes that definite distribu¬ 
tions exist for interfacial area both in vertical and radial directions. The 
influence of various pertinent groups has been discussed by Rodger et ah (R6). 

As completely theoretical analysis is not available for such systems, em¬ 
pirically found correlations involving pertinent dimensional groups have been 
proposed. Vermeulen et ah (V3) have listed ten sets of dimensionless variables 
which can possibly influence the nature of the dispersions obtained. 

Out of these, six, viz. Weber number, Reynolds number, volume fraction A, 
viscosity ratios, density ratios, and length ratios were finally used for correla¬ 
tion. The other four were used but found not to improve the correlation. 

The final correlations of Vermeulen et ah (V3) are: 

For liquid-liquid systems: 


N 2 p'L m d 5/3 

yJT 


= 0.016 


(160) 


where N equals angular velocity of the stirrer (revolutions per second or 
revolutions per minute) 

p’ = 0.4 p c + 0.6 p d 
L = paddle diameter (centimeters) 

f x = ratio of actual mean drop or bubble diameter to diameter at X = 0.1 
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For gas-liquid systems: 


N 15 dLp°- 5 n ° D - 75 

yfxv 025 


= 4.3 x 10~ 3 


( 161 ) 


The authors have also given a unified correlation applicable to both bub¬ 
bles and drops. 

Calderbank (C3) has given different correlations for drops and bubbles as 
well as for coalescing and noncoalescing systems. However, Jackson (Jl) has 
pointed out that it is unlikely that an absolute classification of this type is 
possible in practice, where speed of coalescence is continuously varying. 

Rodriguez et ah (R7) have presented correlations on the basis of power 
dissipation which can be useful for scale-up purposes. 

Both Vermeulen et ah (V3) and Calderbank (C3) conclude that the mean 
particle size is a function of impeller tip speed, whereas Rodriguez et ah (R7) 
find it to be a function of power input per unit volume. Jackson (Jl) explains 
this apparent discrepancy on the basis that the System of Rodriguez et ah 
(R7) was more coalescing in nature than the systems studied by the others 
(C3, V3). In a coalescing dispersion there is frequent circulation and redis¬ 
persion which requires impeller power. It is further pointed out (Jl) that, 
although the tip speed determines the mean particle size leaving the impeller, 
the particle size will also depend on the frequency of circulation which is a 
function of power input. 

Thus, here again a final correlation taking into account all the factors is 
still not available. 


XVIII. Bubble Formation under Intermediate Conditions and at 

Sintered Disks 

A. Bubble Formation under Intermediate Conditions 

When both the chamber pressure and the gas flow rate into the forming 
bubble are time dependent, the bubbles are said to be formed under inter¬ 
mediate conditions. The experiments conducted in this region yield results 
which in some respects are quite different from those obtained under con¬ 
stant flow or constant pressure conditions. A major difference is observed with 
respect to the influence of the depth of submergence on the bubble volume. 
Whereas the submergence has no influence under constant flow or constant 
pressure conditions, it has marked influence (PI) under intermediate condi¬ 
tions. 

The importance of chamber volume as a parameter was recognized by 
Hughes et ah (H7) and by Davidson and Amick (D5). Hughes et ah (H7) gave 
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conditions under which the chamber volume influence cannot be neglected. 
As the pressure inside the chamber fluctuates, there is a time lag between the 
instant of detachment and the start of the next bubble. Any attempt towards 
quantitative prediction must take this as well as the variation of flow into 
consideration. 

Kurana and Kumar (K20) have written the general equations taking the 
variation of flow into consideration for the two step mechanism of bubble 
formation. The fluctuations of pressure and the corresponding variations in 
flow are treated by an electrical analog. 

Their first and second stage equations, respectively, are 


and 


Vfb(p, - p g )g = 


[p, +(H/16 )/>,] 
12tt(3/4ti) 2/3 F% 3 



+ 7 zDy + 


3 pQ/b 

2(3/47t) 1/3 K# 3 


(162) 


V, i Pe + Pi ) (^) + @>( p 9 + Pi) v = V < - 6nr P( v + v e) +*Dy 


[Pg + (11/16)p,] 
127r(3/47r) 2/3 Kf /3 



(163) 


In the viscous resistance term in Eq. (163), the radius r can be taken as 
1.25 r fb without causing a great loss in accuracy. Equations (162) and (163) 
reduce to constant flow or constant pressure conditions, respectively, when Q 
is constant or when it is expressed in terms of orifice equation. 

To express the flow rate as a function of time, an electrical analog shown 
in Fig. 27 has been used. 

is a voltage source which represents the pressure source and has an 
internal resistance R t which is analogous to the pipeline resistances, etc. Volt¬ 
meter V represents a manometer. R T and L r are the tip resistance and induc¬ 
tance, respectively. E 2 is a voltage source connected in opposition to E t and 
represents the orifice submergence /?. Ammeter A is analogous to the flow 
meter, and the capacitor C represents the chamber. The capacitor has been 
connected in parallel and not in series because in the latter case it would act as 
an open circuit. The chamber in the actual set-up permits the flow of gas 
through it, in addition to storing a part of the inflow of gas in it during a part 
of the cycle and releasing the stored amount during the remaining part of the 
cycle. The capacitor in parallel behaves in the same manner. R v is a variable 
resistance representing the needle valve which controls the flow of gas into the 
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system. X is an element which represents the bubble itself. This element 
conducts only when the capacitor voltage (chamber pressure) has built up to 
a certain minimum value (P A + h + 2 y/r 0 ) and offers a variable resistance 
once it is conducting. The resistance of this element decreases as the bubble 
radius increases. The resistance of element X is actually a variable but for the 
purposes of calculations an average constant value has been taken which was 
determined experimentally. Further, the sum of the tip resistances and the 
resistance of element X can be neglected in comparison with the valve resis¬ 
tance R 0 . The loss in accuracy due to these assumptions is very small. The 
flow through the system can then be given by 

a(164) 

Equation (165) gives the gas flow rate into the bubble as a function of time. 

The bubble volume is calculated by solving Eq. (164) with Eqs. (162) and 

(163) through numerical methods. First, Eqs. (164) and (162) are solved 
through trial and error and then Eqs. (164) and (163) are solved by the Runge- 
Kutta method with the boundary conditions that when t = 0, dx/dt = v = 0, 
and x = 0. The condition of detachment is taken when x — r fb . 

The above method of calculation is time consuming and a simpler approxi¬ 
mate method has been evolved. This method makes use of the fact that Eq. 

(164) gives flow rate as the sum of a constant and a variable term. The final 
bubble volume can then be considered as the sum of the volume due to 
constant flow rate and the volume by which the capacitor expands before the 
bubble detaches. 

The first calculation of bubble volume is made at average constant flow 
rate by using Eqs. (54) and (61). As the formation cycle consists of two periods 
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(viz. the formation time during which the flow into the bubble occurs and the 
weeping time during which no flow into the bubble occurs), the average flow 
rate must be modified for the weeping periods. 

From the bubble size and the average flow rate, a first approximation of 
frequency (thereby total cycle time), as well as the limits between which the 
chamber pressure oscillates, is determined. The weeping time is calculated 
from the wave-form equations of chamber pressure, which are: 

During storage: 

(P ~ Po)I(Ph ~ Po) = 1 - e~ (,/RvC) (165) 

During discharge: 

(p'-P*)!(P'-p'H) = e-(" R ° c ) (166) 

The flow rate is then corrected by a ratio of the weeping time to the cycle time. 
Next the discharge time constant (R D C) of the capacitor is determined. If this 
is shorter than formation time, the entire capacitor expansion is added to 
bubble volume. If not, the same percentage of the capacitor expansion volume 
is added because the formation time is of the order of the capacitor discharge 
constant. The final value thus obtained forms the starting point for the next 
iterative step. This is continued till the values from two successive iterations 
are nearly the same. 

The predicted values by this method are shown in Fig. 28 where the in¬ 
fluence of flow rate on bubble volume for three submergences is shown. The 
agreement between theory and experiment is good and the theory predicts 
the variation of bubble volume with submergence satisfactorily. 

B. Bubble Formation at Sintered Disks 

Porous plates or sintered disks find extensive use in industry for gas-liquid 
contacting because they yield large numbers of very small bubbles. Under the 
usual operating conditions, sintered disks give bubbles with diameters rang¬ 
ing from 0.05 to 0.5 cm. Experiments (B7) on sintered disks reveal that the 
average bubble volume increases with increasing gas flow rate and liquid 
viscosity. Increases in surface tension and average pore diameter increase the 
bubble volume at low flow rates but have relatively negligible effects at higher 
flow rates. 

Bowonder and Kumar (B7) have proposed a model for this phenomenon 
based on the following assumptions: 

The resistance offered by the disk is so large that the bubble formation 
takes place under essentially constant flow conditions. 

A sintered disk contains an extremely large number of “potential sites” 
for bubble formation, only a fraction of which form the “operative sites.” 
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Fig. 28. Influence of flow rate on bubble volume for different orifice submergences 


The number as well as the location of the operative sites adjust themselves as 
the flow rate varies. This assumption has been experimentally verified. 

The flow rate through each operative site is the same, and no flow occurs 
through the “ non-operative sites.” 

The model further assumes that the number of effective sites is such that 
the bubbles formed from them as if from single nozzles are just sufficient to 
cover the whole area when arranged in hexagonal close packing. This assump¬ 
tion has been verified by measuring the average bubble size as a function of 
flow rate per site and comparing it with the results from an isolated nozzle. 
The hexagonal close packing arrangement is based on visual observation. The 
increase of bubble size with flow rate as well as the packing arrangement of 
bubbles is presented in Fig, 29. 

The above conditions may be expressed as 

where Vf ie denotes the bubble volume from an isolated nozzle at flow rate Q , 
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Fig. 29. Effect of flow rate on number of sites and the hexagonal dose packing arrange¬ 
ment for bubbles. 


and 


(Area of the Disk)(n) 

where n is the number of operative sites per unit area. 

The values of V F for any Q can be obtained from Eqs. (54) and (61) for 
constant flow conditions. 

As V F is implicitly expressed in terms of Q in Eqs. (54) and (61), iterative 
procedure must be used in solving Eq. (167). A value of n is first assumed and 
Q calculated. Corresponding to this Q , V F is evaluated by Eqs. (54) and (61). 
This value of V F is used in Eq. (167) to determine n. The calculated n is then 
compared with the n assumed initially. Depending on the difference between 
the two, a new n is assumed. This process is continued till the two agree. 

A typical set of data for an air-water system on a sintered disk of 3.61 
cm 2 area is presented in Fig. 30. The increase in average bubble volume and 
the decrease in effective number of sites with increasing flow rate are evident 
from Fig. 30. The solid lines correspond to those calculated on the basis of the 
model. The agreement between the theoretical and the experimental values is 
very satisfactory. 

The model has been tested over a viscosity range of 1 to 604 cp, surface 
tension from 22 to 72 dyn/cm, pore diameter from 45 to 110 microns, and 
flow rate per unit area from 1 to 100 cm 3 /sec. 

The success of the model, apart from helping in the evaluation of bubble 
volume from sintered disks, demonstrates the applicability of the simple 
“ single-site ” analysis to an obviously complex situation. 
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Fig. 30. Variation of bubble volume and the effective sites with flow rate. 


XIX. Concluding Remarks 

The single bubble formation at isolated nozzles is now understood both 
under constant flow and constant pressure conditions, though considerable 
work still needs to be done in the intermediate region where bubble size is 
highly influenced by resonance effects. The influence of geometric factors like 
nozzle geometry and orientation can also be predicted. For multiple orificed 
systems, a theoretical analysis is still not achieved, though the various influen- 



THE FORMATION OF BUBBLES AND DROPS 


363 


cing factors have been recognized. For a limited region of variables, the 
equations for single orifices can be applied to multiple nozzled systems also. 

The problem of spraying, atomization, and bubble formation in agitated 
systems still need considerable study, though quite a bit of work has already 
been reported on them. The formation in the above cases has mostly been 
studied in the absence of heat and mass transfer and chemical reaction, the 
presence of which can greatly influence the bubble volume. This has to receive 
considerable attention if the performance of pertinent industrial equipment is 
to be adequately explained. 

The very important field of bubble and drop formation in non-Newtonian 
fluids remains virtually untouched, even in the absence of heat and mass 
transfer and chemical reactions. 


Nomenclature 


a Dimensionless index; gas-liquid 
interfacial area per unit volume 
of dispersion, cm -1 [Hq. (124)] 
A Substitution given in the text; 
area of the noncircular orifice, 
cm 2 [Eq. (107)] 

A f Frontal area of the bubble, cm 2 
b Dimensionless index 
B Substitution given in the text 
c Dimensionless index: velocity of 
sound in the gas, cm/sec 
C Substitution given in the text; 
proportionality constant [Eq. 
(108); chamber capacitance [Eq. 
(166)] 

C x Substitution given in the text 
C 2 Substitution given in the text 
C D Drag coefficient dimensionless 
d Diameter of the bubble, drop, 
droplet, cm 

d Q Minimum droplet diameter, cm 
d m Diameter of maximum surviving 
bubble, cm 

d u Maximum droplet diameter, cm 
d L Linear mean droplet diameter, 
cm 

d s Diameter of static drop, cm 
Ad Range of size of droplets, cm 
D Substitution given in the text; 
diameter of the orifice, cm 


D e Effective diameter of a hypo¬ 
thetical equivalent circular ori¬ 
fice, cm [Eq. (107)] 

E Substitution given in the text; 

pressure, dyn/cm 2 [Eq, (164)] 

E 0 Pressure to which chamber 
charges just before it begins to 
discharge, dyn/cm 2 
E 2 Pressure that can be supplied by 
the source when the flow rate is 
zero, dyn/cm 2 

/ Function [Eq. (155)]; frequency, 
number/min 

f k Ratio of actual mean drop or 
bubble diameter to diameter at 
A = 0.1 

F Substitution given in the text; 

drag of the bubble, gm • cm/sec 2 
F b Force due to buoyancy in a still 
liquid, dyn [Eq. (118)] 

Fb Force due to buoyancy in a mov¬ 
ing continuous phase, dyn [Eq. 
( 118)1 

F d Drag force due to moving con¬ 
tinuous phase; total drag on the 
sphere, dyn [Eq. (97)] 
g Acceleration due to gravity, 
cm/sec 2 

G Volume of the liquid, ft 3 [Eq. 
(160)] 
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h Orifice submergence, cm 
H Substitution given in the text 
H g Volume fraction gas hold-up 
J Substitution given in the text 
k Constant [Eq. (159)]; parameter 
in Ostwald-de Waele model, di¬ 
mensions dependent on n [Eq. 
(96)] 

K Orifice constant, cm 7l2 /gm 112 ; 
correction to Stokes Law [Eq. 
(127)] 

L Paddle diameter, cm 
m Mass of the bubble, gm 
m' Mass of the liquid, gm [Eq. (67)] 
M Virtual mass of the bubble, drop, 
gm 

n Parameter in Ostwald-de Waele 
model, dimensionless [Eq. (96)]; 
number of effective sites per unit 
area of the sintered disk [Eq. 
(167)] 

N Substitution given in the text; 
angular velocity of the stirrer, 
revolutions sec [Eq. (160)] 

N Fr Froude number 
Np r Substitution given in the text 
N Fr Modified Froude group 
N Re Reynolds number 
N Re Substitution given in the text 
N Re Modified Reynolds number 
N WeCr Critical Weber number of maxi¬ 
mum bubble size capable of 
survival 

p Pressure of the liquid [Eq. (155)]; 
pressure outside a bubble, 
gm/cm • sec 2 ; chamber pressure 
at any time t during chamber 
charging period, dyn/cm 2 [Eq. 
(165)] 

p ' Chamber pressure at any time t 
during chamber discharging pe¬ 
riod, dyn/cm 2 

p h Sum of atmospheric pressure and 
submergence, dyn/cm 2 
Pi Pressure inside a bubble, gm/sec 2 
p H Applied pressure, dyn/cm 2 
P Substitution given in the text 
A p Pressure difference, gm/cm sec 2 
P' Substitution given in the text 


Pi Pressure in the drum, gm/cm • 
sec 2 

q Total flow rate through the sin¬ 
tered disk, cm 3 /sec [Eq. (168)1 
Q Volumetric flow rate, cm 3 /sec 
Q t Flow rate at any time, cm 3 /sec 
Q fb Volumetric flow rate at the end 
of the force-balance-bubble stage, 
cm 3 /sec 

r Radius of the bubble, drop, cm 
r F Final radius of the bubble, cm 
r fb Radius of the force-balance 
bubble, cm 

r st Radius of the maximum stable 
droplet, cm 

R Radius of the orifice, cm; ratio 
of the area of a noncircular 
orifice to that of the inscribed 
circle [Eq. (106)] 

R 0 Radius of the residual bubble, 
cm 

R d Discharge resistance,(dyn/cm 2 )/ 
(cm 3 /sec) 

R t Tip resistance, (dyn/cm 2 )/(cm 3 / 
sec) 

R v Valve resistance, (dyn/cm 2 )/ 
(cm 3 /sec) 

Re 0 Orifice Reynolds number [Eq. 

( 122 )] 

Re' Particle Reynolds number; modi¬ 
fied Reynolds number [Eq. (98)] 
s Distance, cm 
t Time, sec 

t c Time of detachment, sec 
t e Time relating to the first stage, 
sec 

T Substitution given in the text 
u Velocity of the continuous phase, 
cm/sec 

U 0 Fluidizing velocity, cm/sec 
v Velocity, cm/sec 
v 0 Velocity of the flow of gas 
through the orifice, cm/sec [Eq. 
(62)] 

v' Velocity of the center of the 
bubble, cm/sec 

v’c Velocity of the dispersed phase 
through the nozzle, cm/sec [Eq. 
(127)] 
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Velocity relating to the first 
stage, cm/sec 

A 

Symmetrical “rate of deforma¬ 
tion” tensor [Eq. (96)] 

Vj 

Velocity of the jet, cm/sec 

6 

Contact angle, degrees; substi¬ 

Vs 

Superficial gas velocity, ft/sec 


tution (D8) 

V 

Volume, cm 3 

A 

Volume fraction of the dispersed 

Vo 

Volume of the residual bubble, 


phase 


cm 3 

P 

Micron (10 _4 cm); viscosity of 

V, 

Bubble volume at any time, cm 3 


the liquid, poise [Eq. (155)] 

Vf 

Final volume of the bubble, 
drop, cm 3 

Pc 

Viscosity of the continuous 
phase, poise; centipoise [Eq. 

V'f 

Bubble volume in a liquid mov¬ 


(127)] 


ing with a velocity w, cm 3 

N 

Dispersed-phase viscosity, poise 

v» 

Volume of the force-balance 

P'9 

Viscosity of the gas, poise 


bubble, cm 3 

Pi 

Viscosity of the liquid, poise 

(Vf), 

Bubble volume from the in¬ 

Pw 

Viscosity of water, poise 


scribed circle, cm 3 

Pd 

Viscosity of the dispersed phase, 

( V F )„.c. 

Bubble volume from the non¬ 


poise [Eq. (161)] 


circular orifice, cm 3 

P 

Density, gm/cm 3 ; density of the 

V K 

Negative volume resulting from 
buoyancy, cm 3 


continuous phase, gm/cm 3 [Eq. 
(159)] 

v m 

Excess volume over and above 
static bubble volume, cm 3 [Eq. 

P' 

Effective mean density, gm/cm 3 
[Eq. (160)] 


(109)] 

Ap 

Density difference between the 

Vs 

Static bubble volume, cm 3 [Eq. 


liquid and the gas, gm/cm 3 


(110); static drop volume apply¬ 
ing Harkins’ correction, cm 3 

Pc 

Density of the continuous phase, 
gm/cm 3 


[Eq. (125)] 

Pi 

Density of the dispersed phase. 

V v 

Volume representing the force 


gm/cm 3 


experienced by the moving drop, 
cm 3 

Pq 

Density of the gas, gm/cm 3 ; lb/ 
ft 3 [Eq. (123)] 

w 

Power dissipated by impeller, 

Pi 

Density of the liquid, gm/cm 3 


horsepower [Eq. (159)] 

T 

Proportionality factor 

X 

Distance, cm 

T 

Shear stress, dyn/cm 2 [Eq. (96)] 

X, 

Function of the parameter, n 

<t> 

Substitution given in the text; 

z 

Substitution given in the text 


angle between the orifice plate 
and the horizontal, degrees; 

Greek Letters 


correction factor of Harkins and 
Brown [Eq. (132)] 

y 

Surface tension; interfacial ten¬ 
sion, dyn/cm 

<A 

Substitution given in the text; 
Function of diameters, [Eq. (125) 
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Adhesion, electrostatic charge and, 30-38 
Adjacency matrix 
Boolean, 194-195 
branch in, 213 
graph and, 207 
maximal loops in, 200-202 
paths and other loops in, 205 
for process system, 220 
tearing and, 213-214 
tree of paths in, 213-214 
Aerosols 

charge measurement in, 79-81 
Coulomb forces in, 10 
deposition rates in, 28-29 
drop coalescence in, 25-26 
electrostatic effects in, 2, 10-25 
flocculation coefficient in, 15 
flocculation rate in, 17 
floe properties of, 21-23 
monodisperse homopolar, 16 
optimum charge in, 14 
in respiratory tract, 26 
stability of, 15-21 
Air, properties of, 87 
Alcohol dehydration, 166-167 
confidence region in, 129 
ethanol adsorption constant and, 141,146 
model of, 107-108 
parameter estimation in, 120 
sums-of-squares surfaces in, 118-119 
Algorithm, decomposition, 196-200 
see also Decomposition; Steward’s algo¬ 
rithm 

Ambipolar ion, 49 
Analysis of variance 
in model adequacy tests, 131-137 


F statistic in, 133 
regression in, 133-135 
Atomic Energy Commission, example of 
decomposition procedure for, 226- 
236 

Atomization 

charge-to-mass ratio in, 40-42 
corona in, 41-42 
electrostatic, 38-43 
in high vacuum, 38-40 
in liquid-gas system, 348-349 
in liquid-liquid system, 347-348 
of oils, 40-41 
spraying and, 347-349 
Avogadro’s number, 87, 89 


B 

Back discharge, 46-47 
Balloelectricity, 55 
Billingsley’s algorithm, 208 
Boltzmann constant, 87, 89 
Boolean adjacency matrix, 194-195 
Boolean matrix, in process models, 188-189 
Boolean multiplication, 189-190 
Boolean union, 190-191 
partitioning and, 202 
Branch, in equation systems, 213 
Brownian motion, 11 
Bubble expansion of at orifice, 304-306 
Bubble and drop formation, 255-363 
see also Bubble formation; Drop forma¬ 
tion 

in dispersed phase, 257 
interface surface in, 256 
unified model for, 350-354 
Bubble chamber, volume of, 268-270 
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Bubble formation 
bubble volume and, 257-258 
chamber volume, and 268-270, 356-357 
in constant-flow conditions, 277-304, 
327-330 

under constant pressure, 304-316,330-332 
continuous-phase velocity and, 277 
Davidson and Schuler's model of, 278- 
279, 285, 289-290, 307-310, 319 
density and, 302-304 
electric analogue in, 357 
experimental equipment for, 265-266 
flow rates in, 276-277, 286, 290-291, 361 
frequency of, 263-265 
in gas fluidized beds, 318-320 
gas properties of, 274-277 
in highly viscous liquids, 289-295, 308- 
310 

idealized sequence in, 282 
at inclined orifice, 330 
under intermediate conditions, 356-359 
in inviscid fluids, 278-289, 307-313 
Kumar and Kaloor’s model of, 280-281, 
285, 292-298, 310-318, 320 
liquid density and, 273-274, 302-304 
liquid viscosity and, 272-273 
models of, 278-281, 285, 289-298, 307- 
320 

at multiple-orificed plates, 333-334 
in non-Newtonian fluids, 316-320 
orifice characteristics and, 267-268 
orifice geometry and, 321-324 
orifice orientation and, 324-332 
in Ostwald-de-Waele fluids, 316 
particle size and, 319 
at sintered disks, 359-362 
submergence effects on, 270-271 
surface tension and, 286-289, 298-301, 
307 

system variables in, 267 
Tate’s law and, 271 
upward buoyance force in, 282-283 
viscosity and, 278-295, 307-313 
wetting properties, and 271-272 
Bubble size 

continuous-phase velocity and, 332-333 
experimental set-up in, 265-266 
factors in, 265-277 

flow rate and, 276-277, 286, 290-291, 361 
orifice geometry and, 321-324 


in stirred vessels, 354-356 
weeping time and, 359 
Bubble volume 
cinephotography in, 258-259 
direct methods in, 258-260 
flow rate and, 314, 362 
gamma-ray absorption method in, 259- 
260 

indirect methods for, 260-265 
liquid properties and, 278 
measurement of, 257-265 
orifice orientation and, 325-326 
rigorous calculations for, 287 
simplified calculations for, 287 
submerged orifice and, 360 
surface tension and, 274-276, 287-289, 
298-301,312-313 
viscosity and, 301-302 
X-ray cinephotography in, 259 
Bulk fluid, rupture of, 257 
Buoyancy force, bubble volume and, 298- 
299 

Butanol dehydration, 146 
Butyl alcohol, dehydration of, 143 


C 

Cab-O-Sil, 68 

Capacitance technique, in bubble formation 
measurement, 264-265 
Carbon tetrachloride, dispersion of, 41 
Carbopol, 71 
Carbowax, 68 

Chamber volume, bubble formation and, 
356-357 
Charge 

electrostatic, 2-3; see also Electrostatic 
charge 

in metal-metal contact, 62 
phase change and, 73-74 
surface gradient and, 5 
Charged bodies, defined, 2 
Charged particles 
collision rate of, 15 
Coulomb’s law and, 30-31 
electrostatic force between, 30-35 
Charged spheres 
attractive force between, 33 
electrostatic factors in, 32 
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Charge measurement, techniques of, 75-87 
Charge separation, in liquid-gas interface, 
55-56 

Charge transfer, mechanism of, 61-62 
Charge-to-mass ratio, 4, 25, 27 
atomization and, 39, 42 
for electron and proton, 87 
measurement of, 77, 81 
Charging 

through condensation, 74 
contact, 46-67 
induction or field, 47-51 
and interface alteration, 56 
metal-to-metal contact, 62 
by visible light, 74 

Chemical engineering, models in, 98-99 
see also Model(s) 

Chemical plant, modeling of, 99 
Chemical reactions, mathematical modeling 
of, 97-181 

Chemical warfare, aerosols in, 14 
Cinephotography 

in bubble measurement formation, 263- 
264 

in bubble volume measurement, 258-259 
Cloud 

as aerosol, 10 
charged, 12 

corona charge loss in, 13 

edge velocity of, 11-12 

expansion of, 11-14 

maximum permissible diameter of, 13 

properties of, 11-14 

Coalescence, electrostatic charge and, 25-28 
Coal-iron contact, charging in, 71 
Coehn’s rule, 66 

Compressive contact, charging in, 67-68 
Computer 

in model discrimination and parameter 
estimation, 178 

in partitioning and disjoint subsystem 
work, 211 

Sargent’s algorithm and, 221 
Computer program, for precedence ordering 
in large-scale equation systems, 
237-251 

Condensation, charging through, 74 
Confidence region 

in model-discrimination designs, 176— 
177 


in nitric oxide reduction, 168-169 
plotting of in parametric estimation, 
129-131 

in reaction-rate modeling, 124-129 
Constant-flow conditions, bubble formation 
in, 277-304 

Constant-pressure conditions, bubble for¬ 
mation in, 304-316 
Contact charging, 46-47, 62 
Contact electrification, 60 
Continuous-phase velocity, in bubble for¬ 
mation, 277 

Conversion factors, for physical units 
(table), 84-86 
Corona 

atomization and, 41^12 
negative, 45 
point-to-plane, 44 
positive, 45 

Coulomb force, in aerosol, 10 
Coulomb’s law, 6 

interparticle electrostatic force and, 30- 
31 


D 

Data location, in model designs, 168 

Davidson and Harrison model, of bubble 
formation, 319-320 

Davidson and Schuler model, bubble for¬ 
mation, 278-279, 285, 289-290, 307- 
310, 319 

Decomposition 

Hanford N-reaction plant examples of, 
226-236 

mathematical definition of, 187 
partitioning in, 198-208 
precedence ordering techniques in, 222- 
226 

tearing in, 211-220 

Decomposition algorithms, criteria for, 
199-200 

Decomposition procedure, for solving large- 
scale systems of equations, 185-253 

Density, bubble formation and, 273-274, 
302-304 

Dependent variable, in reaction-rate models, 
159-160 

Deposition, electrostatic charge and, 28-29 
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Diagnostic parameters 
adaptive model-building with, 147-156 
for hyperbolic models, 147-152 
model discrimination and, 142-147 
for power-function models, 152-156 
Dibutyl phthalate, atomization of, 42 
Differential models, 101-102 
Differentiation, in power-function models, 
103 

Diffusion, force equation for, 37 
Diffusion charging, 51-53 
Diffusion rate, in aerosols, 17 
Digraph, matrices and, 188-189 
Disjoint subsystems, identifying of, 209-211 
Dispersion, through submerged orifices, 257 
Drag coefficient, in bubble formation, 305 
Drop and bubble formation, unified model 
for, 350-354 

Drop formation, 334-343 
see also Bubble formation 
dispersed phase viscosity and, 339 
Hayworth and Treybal model of, 335-336 
Kumar and Kuloor model of, 337-339 
in non-Newtonian fluids, 343-346 
Null and Johnson model of, 337 
other models of, 340-343 
in power-law fluids, 343-345 
in stirred vessels, 354-356 
variables in, 334-335 
for vertically oriented orifices, 346-347 
Drop volume 

dispersed-phase viscosity and, 342 
interfacial tension and, 341 
viscosity and, 340-342 
volumetric flow rate and, 341 
Dual-site model, 147, 167 


E 

Electrical endosmosis, 58, 66 
Electrification, contact and static, 60 
Electroconstriction, 8 
Electroendosmotic experiments, 66 
Electron 
charge on, 90 
energy level of, 60-61 
Fermi level of, 61-62 
ionization by, 44 
in solid state, 61 


in thermionic emission charging, 53-54 
Electronic probe, in charge measurement, 82 
Electron point charge, 34 
Electron transfer, through rubbing, 60-62 
Electron tunneling, 63 
Electrostatic charge 
see also Charge; Particle 
adhesion and, 30-38 
biological processes and, 38 
defined, 2-3 
deposition and, 28-29 
effects of, 7, 25-28 
evaporation and, 28-29 
field intensity and, 74 
in liquid-solid contact, 58-60 
and material properties, 7-8 
maximum stable, 43-46 
measurement of, 75-87 
methods of charging or discharging 
particles, 43-75 
nomenclature in, 88-90 
powder dissemination and, 43 
in solid-solid contact, 60-72 
Electrostatic dissemination techniques, 38- 
43 

Electrostatic effect 
on aerosol properties, 10-25 
on containment or bulk storage, 8 
on material properties, 28-29 
Electrostatic field 
high intensities of, 9 
polarization in, 3 
Electrostatic force 
between charged particles, 30-35 
comparison with other forces, 35-38 
Electrostatics, fundamentals of, 2-6 
Empirical model, 102 
response-surface methodology in, 155- 
159 

techniques for, 156-167 
transformation of variables in, 159-164 
tuning in, 164-167 
Endosmosis, 58-59, 66 
Energy level, solid state and, 60-61 
Equation block, 211 
adjacency matrix and, 214 
tearing methods for, 212-220 
Equations 

adjacency matrix and, 200-202 
decomposition methods for, 185-253 
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information flow among, 187 
irreducible sets of, 200-202 
large-scale systems of, 185-253 
maximal loops and, 200-202 
occurrence matrix and, see Occurrence 
matrix 

output set for, 196-198 
partitioning in, 198-208 
subsystems of, 187 
systems of, 193-196 
Equilibrium charge, 48 
Equilibrium vapor pressure, 7 
Evaporation, electrostatic charge and, 28-29 
Experimental designs, in model making, 
168-174 


F 

Faraday cage, 79 

Fermi-Dirac statistics, 61-62 

Fermi level, 61, 63 

Field charging, 47-51, 74 

Field intensity, of point charge, 34, 74 

Flocculation 

aerosol properties in, 21-23 
effective size in, 23 
Flocculation coefficient, 15 
Flocculation rate, in aerosols, 17 
particle charge and, 20 
particle mass concentration and, 19 
particle number concentration and, 18 
Floe diameter, effective size and, 23 
Flow rates 

bubble formation and, 286, 290-291 
bubble size and, 361 
bubble volume and, 314, 362 
Fluid, rupture of, 257 
Fluid-fluid contact, interface in, 256-257 
Force balance bubble volume, 312, 317 
Force equations, for various mechanisms, 
36-37 

Force field, particle stability and, 24 

FORTRAN 

precedence ordering techniques and, 223 
Steward’s algorithm and, 206 
Fractional-life methods, in power-function 
models, 103 

Freezing, charging through, 73-74 


Froude number, in bubble formation, 305- 
306 

F statistic, in analysis of variance, 113 


G 

Gamma-ray absorption, in bubble volume 
measurement, 259-260 

Gas-flow rates, and bubble formation, 276- 
277 

Gas properties, bubble formation and, 274- 
276 

Gas sample collection, in bubble volume 
evaluation, 261 

Graph 

adjacency matrix and, 207 
properties of, 188-190 
maximal loop of, 192 
of reactor process, 219 

Ground fog, measurement of, 83 


H 

Hanford N-reactor plant 
decomposition examples from, 226-236 
reordered occurrence matrix of, 252 
Hayworth and Treybal model, of drop for¬ 
mation, 335-336 

Hexyl alcohol, dehydration of, 166 
Homopolar charging, absence of in pow¬ 
ders, 72 

Hougen-Watson model, 105 
Hyperbolic models, 105-110 
estimated parameters in, 109-110 
of methane oxidation, 148-152 
reaction rate and, 105-106 
uncertainty in, 125 


I 

Independent variables 
plots of in model adequacy analysis, 
139-140 

transformations of in empirical modeling, 
161-162 

Induction charging, 47-51 
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Information flow 

Boolean adjacency matrix and, 194 
among equations, 187 
maximal loop in, 200-202 
Insecticide spray, optimum charge in, 14 
Insulators, charging of, 62-63, 66 
Integral models, 101-102 
Integration, in power-function models, 102- 
103 

Interface, in fluid-fluid contact, 256-257 
Interface alteration 
particle charging by, 55-72 
processes and mechanisms in, 56 
Intrinsic parameter, in model discrimina¬ 
tion, 144-147 

Inviscid liquids, bubble formation in, 307- 
308, 210-313 

Inviscid systems, oriented orifice and, 327- 
330 

Ion, ambipolar, 49 

Ion concentration, in interface alterations, 
57 

Ion generator, particle charge and, 53 
Ionization, in atomization, 44 
Ionization coefficient, 45 
Ion mean free path, particle charge and, 49 
Ion mobility, factors affecting, 50 
Ion transfer, in insulators, 66 
Iron-coal contact, charging in, 71-72 
Isooctene hydrogenation, parameter estima¬ 
tion in, 115 

Iterate column, in occurrence matrix, 218 
Iterative techniques, in parameter estima¬ 
tion, 115-116 


J 

Jet breakage, in droplet formation, 349 


K 

Kinetic data, in empirical model tuning, 164 
Kinetic modeling, response-surface me¬ 
thods in, 156-158 

Kumar and Kuloor model, for bubble for¬ 
mation, 280-281, 285, 292-298, 310- 
318, 320, 325-332 
for drop formation, 337-339 


L 

Langmuir-Hinshelwood mechanisms, 100 
Large-scale systems of equations, decompo¬ 
sition procedures for, 185-253 
see also Equations 
Least-squares method 
in model parameter estimation, 111-121 
nonlinear, 115-121 
weighted, 114-115 
Ledet’s algorithm, 216-218, 221 
tearing and, 225-226 
Lee’s algorithm, 226 
Legendre polynomials, 24 n 
Light, charging by, 74 
Lindemann theory, 100 
Linearization method, in parameter esti¬ 
mation, 116 

Linear-least-squares method 
in model adequacy tests, 138 
in model parameter estimation, 111-114 
Linear models, confidence intervals in, 125- 
127 

Liquid, electrostatic atomization of, 38-43 
Liquid breakup, charge nature and magni¬ 
tude in, 57-58 

Liquid density, bubble formation and, 273- 
274, 302-304 

Liquid-gas interface, charge separation and, 
55-56 

Liquid-liquid contact, interface in, 256-257 
Liquid properties, bubble volume and, 278 
Liquid-solid contact, particle charge in, 
58-60 

Liquid viscosity, bubble formation and, 
272-273, 289-296, 301-302 
Loops, occurrence matrix and, 215 


M 

Mathematical modeling 
see also Model(s) 
experimental designs, for 168-178 
Matrix 

see also Adjacency matrix; Occurrence 
matrix 

Boolean multiplication in, 189-190 
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nonzero elements in, 194-195, 203, 210, 
229 

occurrence and adjacency, 194-196 
Maximal loop, 192, 200 
matrix of, 204 
terminal equation and, 208 
Mechanisms, models and, 98 
Mercury, charge acquired by, 59-60 
Metal-metal contact, charging in, 62 
Methane oxidation, hyperbolic model of, 
148-152 

Michaelis-Menten mechanisms, 100 

Millikan cell, 76-77 

Model(s) 

adequacy tests for, 131-141 
analysis-of-variance techniques for, 131— 
137 

empirical, 156-157 
experimental designs in, 168-174 
of bubble formation, 278-281, 289-290, 
292-298,302-303, 307-320, 325-332, 
350-354 

computer use in, 178 
confidence intervals and regions in, 124- 
129 

conjecture and experiment in, 168 
data points in, 168 
diagnostic parameters in, 142-156 
discrimination in, 169-173 
disjoint subsystem for, 209 
of drop formation, 337-339 
hyperbolic, 105-110, 147-152 
integral and differential, 101-103 
kinetic, 156-158 

least-squares estimation of parameters in, 
111-121 

linear and nonlinear, 125-129 
linearly reducible, 102-110 
mechanisms and, 98-99 
multiple-response parameter estimation 
in, 192-131 

parameter estimation in, 111-131, 173- 
178 

power-function, 102-104 
process, 98-99 
reaction-rate, 99-101 
reparametrization in, 121-124 
residual analysis of, 137-141 
response-surface methods in, 155-159 
rival, 169-172 


single- versus dual-site, 106, 147 
Model-building, adaptive, 147-156 
Model-discrimination 
designs for, 171-173 
with diagnostic parameters, 142-147 

N 

Naphthalene oxidation data, 122-123 
Naphthalene oxidation system, parameter 
estimation in, 119-120 
Nebulizer, 57 

Needle, drop coalescence from, 25-26 
Newtonian fluid 
flow index in, 316 
power-law drops in, 345-346 
Newton’s iteration function, 186 
Newton’s second law, in bubble formation, 
304 

Nitric oxide reduction, data location in, 
168-169 

Nonintrinsic parameters, in model discrimi¬ 
nation, 142 

Nonlinear least squares, method of, 115-123 
Nonlinear models, confidence intervals in, 
127-129 

non-Newtonian fluids 
bubble formation in, 316-320 
drop formation in, 343-345 
Nonzero element 

in occurrence matrix, 194-195, 210 
in product matrix, 203, 229 

O 

Occurrence matrix, 193 
adjacency matrix and, 195 
best tear in, 216-217 
disjoint subsystems and, 209-210 
Hanford N-reactor system and, 227, 
229-230, 252 
iterate column in, 218 
nonzero elements in, 210 
output variables and, 234 
reduced, 222 
residual, 222, 252 
torn variables in, 217 
for triple-effect evaporator system, 230- 
231 
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Oils, atomization of, 40 
Orifice 

bubble expansion and, 304-306 
bubble formation and, 267-268, 324-332 
drop formation from, 346-347 
multiple, 333-334 
wetting properties and, 271-272 
Orifice geometry, bubble size and, 321-324 
Orifice orientation, bubble formation and, 
324-332 

Orifice submergence, in bubble formation, 
270-271, 360 

Ostwald-de-Waele fluids, bubble formation 
and,316 
Output set, 193 
finding of, 196-198 
Output variable, 193 

Overall plots, in residual analysis of models, 
137-138 


P 

PACER digital executive program, 187 
Parameter estimation 

analysis of variance and, 132 
computer and, 178 
methods in, 170-171, 178 
model discrimination and, 173-178 
multiple-response, 129-131 
weighted least-squares method in, 114— 
115 

Parametric residuals, in model adequacy 
tests, 140-141 
Partial pressures 

feed-component, in hyperbolic models, 
147-148 

in model-discrimination design, 173 
Particle(s) 

charging or discharging methods for, 
43-75 

contact charge of, 46-47 
diffusion charging of, 51-53 
field charging of, 47-51 
interface alteration charge in, 55-72 
Stokes-Cunningham factor for, 11 
thermionic emission charging of, 53-55 
Particle charge 

see also Charge; Charging; Electrostatic 
charge 


back discharge in, 47 
and charge-to-mass ratio, 27 
contact charging and, 46-47 
container diameter for, 10 
flocculation rate and, 20 
ion solubility and, 49-50 
measurement of, 75-87 
Particle concentration, spacing and, 21 
Particle mass concentration, flocculation 
rate and, 19 

Particle number concentration, 18 
Particle size 

bubble formation and, 319-320 
charge and, 3-4 

Particle spacing, concentration and, 21 
Particle stability, electrostatic charge and, 
24 

Particle surface gradient, 5 
Particulates, electrostatic phenomena with, 
1-90 
Partitioning 
Boolean union in, 202 
computation times for, 224 
methods of, 202-208 
of system equations, 198-208 
Pauli exclusion principle, 60 
Pentane isomerization 

parameter estimates for, 113 
rate models for, 158 
Phase change, charging by, 73-74 
Photoelectric cell counter, in bubble- 
frequency measurements, 263 
Photography, see Cinephotography 
Photoionization, charging through, 75 
Physical units, conversion factors for (table), 
84-86 

Planck’s contant, 87-88 
Pneumatic conveying contact, charging in, 
68 

Point change, force and field identity due to, 
34 

Polarization 

between charged spheres, 33 
in electrostatic field, 3 
Powdered milk, charging in, 69-70 
Powders 

electrostatic dissemination of, 43 
homopolar charging in, 72 
pneumatically conveyed, 68 
Power-function models, 102-104 
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diagnostic parameters for, 152-156 
multicomponent, 104 
Precedence ordering 
computer program for, 237-251 
decomposition techniques for, 222-226 
Predicted-value plots, in model analysis, 
138-139 

Pressure pulse technique, in bubble-fre¬ 
quency meaurements, 264 
Process exploitation, response-surface me¬ 
thods in, 157-159 

Process flow sheet, information in, 188 
Process models, 98-99 
information flow in, 188-196 
Process system 
adjacency matrix for, 220 
design parameters in, 220 
input-output relationships in, 219 
Process units, tearing in, 219-222 
Propositions, in symbolic logic, 189 
Propylene hydrogenation, 139-140, 163 
discrimination design in, 174-175 
Proton, mass of, 89 

Q 

Quadrupole mass spectrometer, 77 

R 

Ramakrishnan model, of bubble formation, 
295-298 

Rau and co-workers model, of drop forma¬ 
tion, 337-339 
Rayleigh instability, 24-25 
Reachability matrix, 192 
Reaction driving force, in empirical model¬ 
ing, 163 

Reaction order, in empirical modeling, 161 
Reaction rate 

in empirical model tuning, 165-166 
in hyperbolic models, 105 
in response-surface methodology, 155-156 
Reaction-rate models, 99-101 
confidence intervals and regions in, 
124-129 

diagnostic parameters in, 151 
transformation of variables in, 159-161 
Reactor process, graph of, 219 


Regression, in analysis-of-variance tech¬ 
niques for model adequacy, 133-137 
Relaxation time, defined, 9 
Reparametrization, in chemical models, 
121-124 

Reservoir method, in bubble volume evalua¬ 
tion, 260 

Residual analysis, in model adequacy tests, 
137-141 

Residual plots, in model analysis, 139 
Respiratory tract, aerosols in, 26 
Response surface 
in empirical modeling, 155-159 
in process exploitation, 157-159 
Reynolds number, in bubble formation, 
306, 316 

Riot control, aerosols in, 14 
Rolling contact, charging in, 70-71 
Rotameter, in bubble volume measurement, 
263 

Rubbing 

electron transfer through, 60-62 
of powders, 68-70 

Runge-Kutta method, in bubble volume 
calculations, 317 


S 

Sargent’s algorithm, 220-221, 226 
Satyanarayan model, of bubble formation, 
310-318 

Shock waves, charging through, 75 
Silica, charging of in pneumatic conveyors, 
68-69 

Simultaneous equations, partitioning in, 
198-205 

see also Equations 
Single-site model, 106, 109, 147 
Sintered disks, bubble formation at, 359-362 
Sliding contact, charging in, 69 
Soap-film flow meter, 262 
Solidification, charging through, 73-74 
Solid-solid contact 
charge in, 60-72 
compressive, 67-68 
pneumatic, 68-69 
rolling, 70-71 
sliding, 69-70 

solid state structure in, 60-61 
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Solid-solid reaction, in empirical modeling, 
160 

Solid state structure, theory of, 60-61 
Spray electrification, 55 
Spraying and atomization, 347-349 
see also Atomization 
Standard air, properties of, 87 
Static electrification, 60 
Steepest-descent method, in parameter esti¬ 
mation, 116 

Steward’s algorithm, 205, 216, 221, 223 
output set of, 196-198, 224 
second phase of, 206 
Stokes-Cunningham factor, 11 
Stokes terminal velocity, bubble velocity 
and,325 

Stirred vessels, drop and bubble size in, 
354-356 

Streaming potential, in liquid-solid contact, 
58 

Submerged orifices, dispersion through, 257 
Submergence, in bubble formation, 270- 
271,360 

Sums-of-squares, surfaces parameters in, 
118-119 

Surface states, charging and, 71-72 
Surface tension 

bubble formation and, 271-272, 286-289, 
298-301, 307, 327 

bubble volume and,274-276,300,312-313 
in constant-flow conditions, 274 
orifice geometry and, 323-324 
pressure due to, 7 
vapor pressure and, 7 
Symbolic logic, propositions in, 189 
Systems equations 
branch in, 213 

Hanford N-reactor plant examples of, 
226-236 

partitioning of, 198-208 
representation of, 193-196 
tearing in, 211-220 

System variables, for triple-effect evaporator 
system, 234-235 

T 

Tate’s law, 271 
Tearing 

algorithms in, 219-222 
concept of, 211-212 
as decomposition process, 211-222 


for large-scale systems of equations, 
212-220 

Ledet’s algorithm and, 225 
loops in, 215 

Terminal equation, maximal loop and, 208 
Thermal equilibrium, electron and, 61 
Thermionic emission charging, 53-55 
Time-residual plots, in model adequacy 
tests, 140 

Torn variables, in occurrence matrix, 217 
Tree, in adjacency matrix, 213-224 
Triboelectric series, table of, 64-65 
Triboelectrification, 60 
Triple-effect evaporator, in Hanford N- 
reactor example, 230-235 
Truth values, in symbolic logic, 189 
Tunnelling, electron, 63 

U 

Union, Boolean, 190-191 
Universal constants (table), 87 

V 

Vacuum, atomization in ,38-40 
Van der Waals force, equation for, 37 
Vapor pressure, equilibrium, 7 
Variables, transformation of in empirical 
modeling, 159-164 
Viscosity 

bubble formation and, 272-273, 289-290 
bubble volume and, 301-302 
Viscous liquids 

bubble formation and, 272-273, 289-296 
oriented orifices in, 327-330 
Volumetric flow rate, bubble volume and, 
260-263 

W 

Water drops, coalescence of, 26 
see also Drop formation 

X 

X-ray cinephotography, in bubble volume 
measurement, 259 

Z 

Zeta potential, in liquid-solid contact, 58 



